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Docket No. PT145/01.PRO 
CRYSTALS AND STRUCTURES OF p21-ACTIVATED KINASE 6 



INTRODUCTION 

[0001] The present invention concerns crystalline forms of polypeptides that 
correspond to the kinase domain of p21 -activated kinase 6 (PAK6KD), methods of 
obtaining such crystals, and to the high-resolution X-ray diffraction structures and 
molecular structure coordinates obtained therefrom. The crystals of the invention and the 
atomic structural information obtained therefrom are useful, for example, for solving the 
crystal and solution structures of related and unrelated proteins, for screening for, 
identifying, and/or designing protein analogues and modified proteins, and for screening 
for, identifying and/or designing compounds that bind to and/or modulate a biological 
activity of PAK6, including inhibitors and activators of PAK6 activity. 
[0002] PAK6 (p21 -activated kinase 6) is a kinase in the STE group of serine/threonine 
kinases. It is a member of the group II PAKs, which comprises PAK4, PAK5, and PAK6 
(Jaffer, Z. M., and Chernoff, J., 7. Biochem Cell Bio., 34:713-717, 2002). The PAKs 
contain an N-terminal p21-Rho-binding domain and a C-terminal catalytic protein kinase 
domain. Both PAK groups have been implicated in actin reorganization, gene 
transcription, and apoptosis, while the group II PAKs have also been implicated in cell 
transformation and hormone signaling (Jaffer and Chernoff, 2002). PAK6 interacts with 
at least two nuclear hormone receptors; it is recruited to the nucleus in the presence of 
ligand-bound androgen receptor (AR) (Yang, F., et al., J. Biol Chem, 276:15345-15353, 
2001) and it is able to bind Era (Lee, S. R., et al., Mol Endocrinol 16:85-99, 2002). 
Binding of PAK6 to AR appears to repress transcription by preventing coactivator 
binding. This may be a novel mechanism within the PAK family and has implications for 
the treatment of prostate and breast cancer. Unphosphorylated (active) PAK6 has recently 
been shown to be differentially expressed in prostate cancer cell lines (Schrantz, N., et al., 
J. Biol Chem y 279:1922-1931, 2004). 

[0003] The present invention describes the 3-dimensional structure of the kinase 
domain of PAK6 (PAK6KD). The 3 dimensional structure of PAK6 may be useful, for 
example, for identifying novel therapeutic compounds that can modulate protein kinase 
activity, and for treatment of diseases or conditions mediated by human signal 
transduction kinase activity, or diseases or conditions that may be alleviated or prevented 
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by modulation of, PAK6 activity, and/or, for example, actin reorganization, gene 
transcription, apoptosis, hormone signaling, or cell transformation. 
[0004] Knowledge of the 3-D structures of target proteins provides an important basis 
for structure-based approaches to drug design by defining the topographies of the 
complementary surfaces of ligands and their protein targets. Therefore, knowledge of the 
structure of the PAK6KD protein described in the present invention may be useful in the 
identification, design, or development of novel and specific modulators of protein kinase 
activity as well as diagnostic and pharmaceutical compounds useful for disorders 
associated with aberrant PAK6 expression or activity. Knowledge of the structure may 
also be useful for gene therapy. The structural coordinates may be used, for example, to 
engineer more stable or other modified PAK6s. The ability to obtain the molecular 
structure coordinates of PAK6KD has not previously been realized. 
[0005] Citation of documents herein is not intended as an admission that any is 
pertinent prior art. All statements as to the date or representation as to the contents of 
documents is based on the information available to the applicant and does not constitute 
any admission as to the correctness of the dates or contents of the documents. 

SUMMARY OF THE INVENTION 

[0006] The present invention provides crystalline PAK6KD, its molecular structure in 
atomic detail, homologs and mutants of the structure, methods of using the structure to 
identify and design compounds that modulate the activity of PAK6, methods of preparing 
identified and/or designed compounds, methods of affecting cell growth and/or viability, 
and thus treating diseases or conditions, by modulating PAK6 activity, and methods of 
identifying and designing mutant PAK6s. Knowledge of the structure of PAK6KD may 
be useful in the development of novel compounds regulating, for example, cell 
proliferation, cell migration, differentiation, cytoskeletal organization, gene expression, 
cell cycle progression, and cell death. Knowledge of the structure of PAK6KD may also 
be used to model the structure of kinases with related ligand binding sites, such as, for 
example, other PAKs. 

[0007] By "PAK6 activity" is meant PAK6 kinase activity, binding activity, 

imunogenicity, or any enzymatic activity of the PAK6 protein, or the PAK6 kinase domain 

alone. Thus, PAK6 activity may be assayed, where appropriate, using all or a portion of 

the entire PAK6 molecule. For example, the PAK6 kinase domain alone may be used in 
Express Mai! No. EV3 151 35696US 2 
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kinase, binding, immunogenicity, or other PAK6 enzymatic activities. Similarly, a 
modulator, inhibitor, or activator of PAK6 protein may also be a modulator, inhibitor, or 
activator of the PAK6 kinase domain, and modulation, inhibition or activation of PAK6 
activity may be assayed by assaying the modulation, inhibition, or activation of PAK6 
kinase domain activity. Also, where PAK6KD activity is assayed, portions of the PAK6 
molecule in addition to the PAK6KD may be used in the assay. Thus, for example, where 
the present invention describes assaying modulation, inhibition, or activation of PAK6KD, 
instead, an assay may be performed to determine modulation, inhibition, or activation of 
PAK6. 

[0008] Thus, in one aspect, the invention provides purified PAK6KD, and methods of 
purifying PAK6KD. PAK6KD may be sufficiently pure such that it may be used to 
prepare diffraction quality crystals. For ease of obtaining diffraction quality crystals, the 
purified PAK6KD may be predominantly, or entirely, of one phosphorylation state. 
[0009] Thus, in one aspect, the invention provides a crystal comprising PAK6 or 
PAK6KD peptides in preferrred crystalline form. In some embodiments of the invention 
the crystal is diffraction quality. The crystals of the invention include, for example, 
crystals of wild type PAK6KD, crystals of mutated PAK6KD, native crystals, heavy-atom 
derivative crystals, and crystals of PAK6KD homologs or PAK6KD mutants, such as, but 
not limited to, selenomethionine or selenocysteine mutants, mutants comprising 
conservative alterations in amino acid residues, and truncated or extended mutants. 
[0010] The crystals of the invention also include co-crystals, in* which crystallized 
PAK6KD is in association with one or more compounds, including but not limited to, 
cofactors, ligands, substrates, substrate analogs, inhibitors, activators, agonists, 
antagonists, modulators, allosteric effectors, etc., to form a crystalline co-complex. Such 
compounds may or may not bind a catalytic or active site of PAK6KD within the crystal. 
Alternatively, such compounds stably interact with another binding pocket of PAK6KD 
within the crystal. The co-crystals may be native co-crystals, in which the co-complex is 
substantially pure, or they may be heavy-atom derivative co-crystals, in which the co- 
complex is in association with one or more heavy-metal atoms, preferably heavy-metal 
atoms that promote anomalous scattering. 

[0011] In other embodiments, the crystals of the invention are of sufficient quality to 

permit the determination of the three-dimensional X-ray diffraction structure of the 

crystalline polypeptide to high resolution, for example, to a resolution of better than 3A, 
Express Mail No. EV3 1 5 1 35696US 3 
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or, at least lA and up to about 3 A, and more typically a resolution of greater than 1.5 A 
and up to 2A or about 2A, or 2.5A or about 2.5 A. 

[0012] In some embodiments, the crystals are characterized by a unit cell of a= 
59.2A+/- 2%, b= 66.7A +/- 2%, c= 97.2A+/- 2%, o^= 90°, (3= 90° , y= 90°, and a space 
group of P 21 21 21. 

[0013] The invention also provides methods of making the crystals of the invention. 
Generally, crystals of the invention are grown by dissolving substantially pure polypeptide 
in an aqueous buffer that includes a precipitant at a concentration just below that necessary 
to precipitate the polypeptide. Water is then removed by controlled evaporation to 
produce precipitating conditions, which are maintained until the crystal forms and the size 
of the crystal is appropriate. 

[0014] Co-crystals of the invention are prepared by soaking a native crystal prepared 
according to the above method in a liquor comprising the compound of the desired co- 
complex. Alternatively, the co-crystals may be prepared by co-crystallizing the 
polypeptide in the presence of the compound according to the method discussed above. 
[0015] Heavy-atom derivative crystals of the invention may be prepared by soaking 
native crystals or co-crystals prepared according to the above method in a liquor 
comprising a salt of a heavy atom or an organometallic compound. Alternatively, heavy- 
atom derivative crystals may be prepared by crystallizing a polypeptide comprising 
modified amino acids, for example, selenomethionine and/or selenocysteine residues 
according to the methods described above for preparing native crystals. 
[0016] In yet another embodiment of the present invention, a method is provided for 
determining the three-dimensional structure of a PAK6KD crystal, comprising the steps of 
providing a crystal of the present invention; and analyzing the crystal by x-ray diffraction 
to determine the three-dimensional structure. Stated differently, the invention provides for 
the production of three-dimensional structural information (or "data") from the crystals of 
the invention. Such information may be in the form of structural coordinates that define 
the three-dimensional structure of PAK6KD in a crystal and/or co-crystal. Alternatively, 
the structural coordinates may define the three-dimensional structure of a portion of 
PAK6KD in the crystal. Non-limiting examples of portions of PAK6KD include the 
catalytic or active site, and a binding pocket. The structural coordinate information may 
include other structural information, such as vector representations of the molecular 
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structures coordinates, and be stored or compiled in the form of a database, optionally in 
electronic form. 

[0017] The invention thus provides methods of producing a computer readable 
database comprising the three-dimensional molecular structural coordinates of binding 
pocket of PAK6KD, said methods comprising obtaining three-dimensional structural 
coordinates defining PAK6KD or a binding pocket of PAK6KD, from a crystal of 
PAK6KD; and introducing said structural coordinates into a computer to produce a 
database containing the molecular structural coordinates of PAK6KD or said binding 
pocket. The invention also provides databases produced by such methods. 
[0018] In an alternative embodiment, the invention provides for the use of identifiers of 
structural information to be all or part of the information defining the three-dimensional 
structure of PAK6KD so that all or part of the actual structural information need not be 
present. For example, and without limiting the invention, identifiers which reference 
structural coordinates defining a three-dimensional structure, substructure or shape may be 
used in place of the actual coordinate information. Such reference structural information 
is optionally stored separately from the identifiers used to define the three-dimensional 
structure of PAK6KD. A non-limiting example is the use of an identifier for an alpha 
helix structure in place of the coordinates of the helical structure, or the use of distances 
and angles to represent the structure. 

[0019] In another aspect, the invention provides computer machine-readable media 
embedded with the three-dimensional structural information obtained from the crystals of 
the invention, or portions or substrates thereof. The invention also provides methods for 
the introduction of the structural information into a computer readable medium, optionally 
as a computer readable database. The types of machine- or computer-readable media into 
which the structural information is embedded typically include magnetic tape, floppy 
discs, hard disc storage media, optical discs, CD-ROM, electrical storage media such as 
RAM or ROM, and hybrids of any of these storage media. Such media further include 
paper that may be read by a scanning device and converted into a three-dimensional 
structure with, for example, optical character recognition (OCR) software. In one 
example, the sheet of paper presents the molecular structure coordinates of crystalline 
polypeptide of the invention that are converted into, for example, a spread sheet by OCR 
software. The machine-readable media of the invention may further comprise additional 
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information that is useful for representing the three-dimensional structure, including, but 
not limited to, thermal parameters, chain identifiers, and connectivity information. 
[0020] Various machine-readable media are provided in the present invention. In one 
aspect, a machine-readable medium is provided that is embedded with information 
defining a three-dimensional structural representation of any of the crystals of the present 
invention, or a fragment or portion thereof. The information may be in the form of 
molecular structure coordinates, such as, for example, those of Fig. 4. Alternatively, the 
information may include an identifier used to reference a particular three dimensional 
structure, substructure or shape. The machine-readable medium may be embedded with 
the molecular structure coordinates of a protein molecule comprising a PAK6KD active 
site, active site homolog, binding pocket or binding pocket homolog. The various 
machine-readable media of the present invention may also comprise data corresponding to 
a molecule comprising a PAK6KD binding pocket or binding pocket homolog in 
association with a compound or molecule bound to the protein, such as in a co-crystal. 
[0021] The molecular structure coordinates and machine-readable media of the 
invention have a variety of uses. For example, the coordinates are useful for solving the 
three-dimensional X-ray diffraction and/or solution structures of other proteins, including 
mutant PAK6KD, co-complexes comprising PAK6KD, and unrelated proteins, to high 
resolution. Structural information may also be used in a variety of molecular modeling 
and computer-based screening applications to, for example, intelligently design mutants of 
the crystallized PAK6KD that have altered biological activity and to computationally 
design and identify compounds that bind the polypeptide or a portion or fragment of the 
polypeptide, such as a subunit, a domain or an active site. Such compounds may be used 
directly or as lead compounds in pharmaceutical efforts to identify compounds that affect 
PAK6KD activity. Compounds that bind to the polypeptide, or to a portion or fragment 
thereof may be used as, for example, antimicrobial agents. 

[0022] The invention thus provides methods of producing a computer readable 
database comprising a representation of a compound capable of binding a binding pocket 
of PAK6KD, said methods comprising introducing into a computer program a computer 
readable database comprising structural coordinates which may be used to produce a three 
dimensional representation of PAK6KD, generating a three-dimensional representation of 
a binding pocket of PAK6KD in said computer program, superimposing a three- 
dimensional model of at least one binding test compound on said representation of the 
Express Mail No. EV315135696US 6 
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binding pocket, assessing whether said test compound model fits spatially into the binding 
pocket of PAK6KD and storing a representation of a compound that fits into the binding 
pocket into a computer readable database. The database used to store the representation of 
a compound may be the same or different from that used to store the structural coordinates 
of PAK6KD. The invention further provides for the electronic transmission of any 
structural information resulting from the practice of the invention, such as by telephonic, 
computer implemented, microwave mediated, and satellite mediated means as non- 
limiting examples. 

[0023] As described above, the molecular structure coordinates and/or machine- 
readable media associated with PAK6KD structure may also be used in the production of 
three-dimensional structural information (or "data") of a compound capable of binding 
PAK6KD. Such information may be in the form of structural coordinates that define the 
three-dimensional structure of a compound, optionally in combination or with reference to 
structural components of PAK6KD. In some embodiments, the structure coordinates of 
the compound are determined and presented (or represented) relative to the structure 
coordinates of the protein. Alternatively, identifiers of structural information are used to 
represent all or part of the information defining the three-dimensional structure of a 
compound so that all or part of the actual structural information need not be present. For 
example, and without limiting the invention, if the structural information of a compound 
includes a region defining a pyrophosphate (or pyrophosphate mimetic) moiety, the 
structural coordinates of pyrophosphate may be substituted by an identifier representing 
the structure of pyrophosphate, such as the name, chemical formula or other chemical 
representation. Any compound capable of binding PAK6KD may be represented by 
chemical name, chemical or molecular formula, chemical structure, and/or other 
identifying information. As a non-limiting example, the compound CH 3 CH 2 OH may be 
represented by names such as ethanoi or ethyl alcohol, abbreviations such as EtOH, 
chemical or molecular formulas such as CH 3 CH 2 OH or C2H5OH or C2H6O, and/or by 
structural representations in two or three dimensions. Non-limiting examples of the latter 
include Fisher projections, electron density maps and representations, space filling models, 
and the following: 
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(C C-^OH 



CH 3 CH 2 -OH 



H 3 C C OH 

H 2 




[0024] Non-limiting examples of other identifying information include Chemical 
Abstract Service (CAS) Registry numbers and physical or chemical properties indicative 
of the compound (such as, but not limited to, NMR spectra, IR spectra, MS spectra, GC 
profiles, and melting point). Of course the structures of a portion of a compound (e.g. a 
substructure) may be similarly identified by reference to any of the above used to identify 
a compound as a whole. 

[0025] To produce structural information of a compound capable of binding PAK6KD, 
the invention provides for the use of a variety of methods, including a) the superimposition 
of structures of known compounds on the structure of PAK6KD or a portion thereof, b) 
the determination of a "pharmacophore" structure which binds PAK6KD, and c) the 
determination of substructure(s) of compounds, wherein the substructure(s) interact with 
PAK6KD. The structural coordinate information may include other structural 
information, such as vector representations of the molecular structures coordinates, and be 
stored or compiled in the form of a database, optionally in electronic form. With respect 
to a), the invention includes the computational screening of a three-dimensional structural 
representation of PAK6KD or a portion thereof, or a molecule comprising a PAK6KD 
binding pocket or binding pocket homolog, with a plurality of chemical compounds and 
chemical entities. Alternatively, the present invention provides a method of identifying at 
least one compound that potentially binds to PAK6KD, comprising, constructing a three- 
Express Mail No. EV3 15 135696US 8 
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dimensional structure of a protein molecule comprising a PAK6KD binding pocket or 
binding pocket homolog, or constructing a three-dimensional structure of a molecule 
comprising a PAK6KD binding pocket, and computationally screening a plurality of 
compounds using the constructed structure, and identifying at least one compound that 
computationally binds to the structure. In one aspect, the method further comprises 
determining whether the compound binds PAK6KD. 

[0026] With respect to b) the invention includes the computational screening of a 
plurality of chemical compounds to determine which compound(s), or portion(s) thereof, 
fit a pharmacophore determined as fitting within a PAK6KD binding pocket. Stated 
differently, the structures of chemical compounds may be screened to identify which 
compound(s), or portion(s) thereof, is encompassed by the parameters of an identified 
pharmacophore. As used herein, "pharmacophore" refers to the structural characteristics 
determined as necessary for a chemical moiety to fit or bind a PAK6KD binding pocket. 
A non-limiting example of a pharmacophore is a description of the electronic 
characteristics necessary for interaction with a binding site. These characteristics may be 
representations of the ground and excited state wave functions of a pharmacophore, 
including specification of known expansions of such functions. Representations of a 
pharmacophore contain the chemical moieties, and/or atoms thereof, within the 
pharmacophore as well as their electronic characteristics and their three dimensional 
arrangement in space. Other representations may also be used because different chemical 
moieties may have similar characteristics. A non-limiting example is seen in the case of a 
-SH moiety at a particular position, which has similar characteristics to a -OH moiety at 
the same position. Chemical moieties that may be substituted for each other within a 
pharmacophore are referred to as "homologous". 

[0027] The present invention thus provides methods for producing a computer readable 

database comprising a representation of a compound capable of binding a binding pocket 

of PAK6KD, said methods comprising introducing into a computer program a computer 

readable database comprising structural coordinates which may be used to produce a three 

dimensional representation of PAK6KD, determining a pharmacophore that fits within 

said binding pocket, computationally screening a plurality of compounds to determine 

which compound(s) or portion(s) thereof fit said pharmacophore, and storing a 

representation of said compound(s) or portion(s) thereof into a computer readable 

database. The database may be the same or different from that used to store the structural 
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coordinates of PAK6KD. Determination of a pharmacophore that fits may be performed 
by any means known in the art. 

[0028] With respect to c) the invention includes the computational screening of a 
plurality of chemical compounds to determine which compounds comprise a substructure 
that interacts with PAK6KD. The invention thus provides methods of producing a 
computer readable database comprising a representation of a compound capable of binding 
a binding pocket of PAK6KD, said methods comprising introducing into a computer 
program a computer readable database comprising structural coordinates which may be 
used to produce a three dimensional representation of PAK6KD, determining a chemical 
moiety that interacts with said binding pocket, computationally screening a plurality of 
compounds to determine which compound(s) comprise said moiety as a substructure of 
said compound(s), and storing a representation of said compound(s) and/or said moiety 
into a computer readable database which may be the same or different from that used to 
store the structural coordinates of PAK6KD. 

[0029] In one embodiment of the invention, the particulars of which may be used in 
combination with the other embodiments of the invention, a method is provided for 
producing structural information of a compound capable of binding PAK6KD by selecting 
at least one compound that potentially binds to PAK6KD. The method comprises 
constructing a three-dimensional structure of PAK6KD having structure coordinates 
selected from the group consisting of the structure coordinates of the crystals of the 
present invention, the structure coordinates of Fig, 4, and the structure coordinates of a 
protein having a root mean square deviation of the alpha carbon atoms of up to about 
1.5 A, preferably up to about 1.25 A, preferably up to about lA, preferably up to about 
0.75 A, preferably up to about 0.5A, and preferably up to about 0.25A, when compared to 
the structure coordinates of Fig. 4, or a portion thereof, or constructing a three- 
dimensional structure of a molecule comprising a PAK6KD binding pocket or binding 
pocket homolog; and selecting at least one compound which potentially binds PAK6KD; 
wherein the selecting is performed with the aid of the constructed structure of PAK6KD. 
[0030] It is anticipated that in some cases, upon binding a compound, the conformation 
of the protein may be altered. Useful compounds may bind to this altered conformational 
form. Thus, included within the scope of the present invention are methods of producing 
structural information of a compound capable of binding PAK6KD by selecting 

compounds that potentially bind to a PAK6KD molecule or homolog where the molecule 
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or homolog comprises an amino acid sequence that is at least 50%, preferably at least 
60%, more preferably at least 70%, more preferably at least 80%, and more preferably at 
least 90% identical to the amino acid sequence of Fig. 2, using, for example, a PSI BLAST 
search, such as, but not limited to version 2.2.2 (Altschul, S.F., et al., Nuc. Acids Rec. 
25:3389-3402, 1997). Preferably at least 50%, more preferably at least 70% of the 
sequence is aligned in this analysis and where at least 50%, more preferably 60%, more 
preferably 70%, more preferably 80%, and most preferably 90% of the amino acids of the 
molecule or homolog have structure coordinates selected from the group consisting of the 
structure coordinates of the crystals of the present invention, the structure coordinates of 
Fig. 4, and the structure coordinates of a protein having a root mean square 
deviation of the alpha carbon atoms of up to about 1 .5 A, preferably up to about 1 .25 A, 
preferably up to about 1 A, preferably up to about 0.75A, preferably up to about 0.5A, and 
preferably up to about 0.25 A, when compared to the structure coordinates of Fig. 4, or a 
portion thereof, or constructing a three dimensional structure of a molecule comprising a 
PAK6 binding pocket or binding pocket homolog; and selecting at least one compound 
which potentially binds PAK6; wherein the selecting is performed with the aid of the 
constructed structure. The selected compounds thus provide information concerning the 
structure of compounds that bind PAK6. 

[0031] Once produced, structural information of a compound capable of binding PAK6 
may be stored in machine-readable form as described above for PAK6 structural 
information. 

[0032] In yet another aspect of the present invention, a method is provided of 

identifying a modulator of PAK6 by rational drug design, comprising; designing a 

potential modulator of PAK6 that forms covalent or non-covalent bonds with amino acids 

in a binding pocket of PAK6 based on the molecular structure coordinates of the crystals 

of the present invention, or based on the molecular structure coordinates of a molecule 

comprising a PAK6 binding pocket or binding pocket homolog; synthesizing the 

modulator; and determining whether the potential modulator affects the activity of PAK6. 

The binding pocket may, for example, comprise the active site of PAK6. The binding 

pocket may instead comprise an allosteric binding pocket of PAK6. A modulator may be, 

for example, an inhibitor, an activator, or an allosteric modulator of PAK6. 

[0033] Other methods of designing modulators of PAK6 include, for example, a 

method for identifying a modulator of PAK6 activity comprising: providing a computer 
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modeling program with a three dimensional conformation for a molecule that comprises a 
binding pocket of PAK6, or binding pocket homolog; providing a said computer modeling 
program with a set of structure coordinates of a chemical entity; using said computer 
modeling program to evaluate the potential binding or interfering interactions between the 
chemical entity and said binding pocket, or binding pocket homolog; and determining 
whether said chemical entity potentially binds to or interferes with said molecule; wherein 
binding to the molecule is indicative of potential modulation, including, for example, 
inhibition of PAK6 activity. 

[0034] In another embodiment, a method is provided for designing a modulator of 
PAK6 activity comprising: providing a computer modeling program with a set of structure 
coordinates, or a three dimensional conformation derived therefrom, for a molecule that 
comprises a binding pocket of PAK6, or binding pocket homolog; providing a said 
computer modeling program with a set of structure coordinates, or a three dimensional 
conformation derived therefrom, of a chemical entity; using said computer modeling 
program to evaluate the potential binding or interfering interactions between the chemical 
entity and said binding pocket, or binding pocket homolog; computationally modifying the 
structure coordinates or three dimensional conformation of said chemical entity; and 
determining whether said modified chemical entity potentially binds to or interferes with 
said molecule; wherein binding to the molecule is indicative of potential modulation of 
PAK6 activity. 

[0035] In other aspects, determining whether the chemical entity potentially binds to 
said molecule comprises performing a fitting operation between the chemical entity and a 
binding pocket, or binding pocket homolog, of the molecule or molecular complex; and 
computationally analyzing the results of the fitting operation to quantify the association 
between, or the interference with, the chemical entity and the binding pocket, or binding 
pocket homolog. In a further embodiment, the method further comprises screening a 
library of chemical entities. 

[0036] The PAK6 modulator may also be designed de novo. Thus, the present 

invention also provides a method for designing a modulator of PAK6, comprising: 

providing a computer modeling program with a set of structure coordinates, or a three 

dimensional conformation derived therefrom, for a molecule that comprises a binding 

pocket having the structure coordinates of the binding pocket of PAK6, or a binding 

pocket homolog; computationally building a chemical entity represented by set of 
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structure coordinates; and determining whether the chemical entity is a modulator 
expected to bind to or interfere with the molecule wherein binding to the molecule is 
indicative of potential modulation of PAK6 activity. In other embodiments, determining 
whether the chemical entity potentially binds to said molecule comprises performing a 
fitting operation between the chemical entity and a binding pocket of the molecule or 
molecular complex, or a binding pocket homolog; and computationally analyzing the 
results of the fitting operation to quantify the association between, or the interference with, 
the chemical entity and the binding pocket, or a binding pocket homolog. 
[0037] In yet other embodiments, once a modulator is computationally designed or 
identified, the potential modulator may be supplied or synthesized, then assayed to 
determine whether it inhibits PAK6 activity. The molecular structure coordinates and/or 
machine-readable media associated with the PAK6 structure and/or a compound capable 
of binding PAK6KD may be used in the production of compounds capable of binding 
PAK6. Methods for the production of such compounds include the preparation of an 
initial compound containing chemical groups most likely to bind or interact with residues 
of PAK6KD based upon the molecular structure coordinates of PAK6KD and/or a 
compound capable of binding it. Such an initial compound may also be viewed as a 
scaffold comprising one or more reactive moieties (chemical groups) that are capable of 
binding or interacting with PAK6 residues. The initial compound may be further 
optimized for binding to PAK6 by introduction of additional chemical groups for 
increased interactions with PAK6KD residues. An initial compound may thus comprise 
reactive groups which may be used to introduce one or more additional chemical groups 
into the compound. The introduction of additional groups may also be at positions of an 
initial compound that do not result in interactions with PAK6 residues, but rather improve 
other characteristics of the compound, such as, but not limited to, stability against 
degradation, handling or storage, solubility in hydrophilic and hydrophobic environments, 
and overall charge dynamics of the compound. 

[0038] The present invention also provides modulators of PAK6 activity identified, 

designed, or made according to any of the methods of the present invention, as well as 

pharmaceutical compositions comprising such modulators. Pharmaceutical compositions 

may be in the form of a salt, and may further comprise a pharmaceutical^ acceptable 

carrier. A modulator may be identified or confirmed as an activator or inhibitor by 

contacting a protein that comprises a PAK6 active site or binding pocket with said 
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modulator and determining whether it activates or inhibits the activity of the protein. The 
activity may be PAK6 activity. A naturally occurring PAK6 protein may also be used in 
such methods. 

[0039] Also provided in the present invention is a method of modulating PAK6 activity 
comprising contacting PAK6 with a modulator designed or identified according to the 
present invention. Methods include methods of treating a disease or condition associated 
with inappropriate PAK6 activity comprising the method of administering by, for 
example, contacting cells of an individual with a PAK6 modulator designed or identified 
according to the present invention. The term "inappropriate activity" refers to PAK6 
activity that is higher or lower than that in normal cells. 

[0040] The molecular structure coordinates and/or machine-readable media of the 
invention may also be used in identification of active sites and binding pockets of 
PAK6KD. Methods for the identification of such sites and pockets are known in the art. 
The techniques include the use of sequence comparisons to identify regions of homology 
or conserved substitutions which define conserved structure among different forms of 
PAK6KD. The techniques may also include comparisons of structure with other proteins 
with the same activities as PAK6 to identify the structural components (e.g. amino acid 
residues and/or their arrangement in three dimensions) of the active sites and binding 
pockets. 

[0041] In another embodiment of the present invention, a method is provided for 

producing a mutant of PAK6, having an altered property relative to PAK6, comprising, a) 

constructing a three-dimensional structure of PAK6KD having structure coordinates 

selected from the group consisting of the structure coordinates of the crystals of the 

present invention, the structure coordinates of Fig. 4, and the structure coordinates of a 

protein having a root mean square deviation of the alpha carbon atoms of the protein of up 

to about 1.5 A, preferably up to about 1.25A, preferably up to about lA, preferably up to 

about 0.75A, preferably up to about 0.5A, and preferably up to about 0.25A, when 

compared to the structure coordinates of Fig. 4; b) using modeling methods to identify in 

the three-dimensional structure at least one structural part of the PAK6KD molecule 

wherein an alteration in the structural part is predicted to result in the altered property; c) 

providing a nucleic acid molecule having a modified sequence that encodes a deletion, 

insertion, or substitution of one or more amino acids at a position corresponding to the 

structural part; and d) expressing the nucleic acid molecule to produce the mutant; wherein 
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the mutant has at least one altered property relative to the parent The mutant may, for 
example, have altered PAK6 activity. The altered PAK6 activity may be, for example, 
altered binding activity, altered enzymatic activity, and altered immunogenicity, such as, 
for example, where an epitope of the protein is altered because of the mutation. The 
mutation that alters the epitope may be, for example, within the region of the protein that 
comprises the epitope. Or, the mutation may be, for example, at a site outside of the 
epitope region, yet causes a conformational change in the epitope region. Those of 
ordinary skill in the art will recognize that the region that contains the epitope may 
comprise either contiguous or non-contiguous amino acids. 

[0042] Also provided in the present invention is a method for obtaining structural 

information about a molecule or a molecular complex of unknown structure comprising: 

crystallizing the molecule or molecular complex; generating an x-ray diffraction pattern 

from the crystallized molecule or molecular complex; and using a molecular replacement 

method to interpret the structure of said molecule; wherein said molecular replacement 

method uses the structure coordinates of Fig. 4, or structure coordinates having a root 

mean square deviation for the alpha-carbon atoms of said structure coordinates of up to 

about 2.0A, preferably up to about 1.75 A, preferably up to about 1.5 A, preferably up to 

about 1.25A, preferably up to about l.OA, preferably up to about 0.75 A, the structure 

coordinates of the binding pocket of Fig. 4, or a binding pocket homolog. The coordinates 

of the resulting structure are stored in a computer readable database as described herein. 

[0043] In another aspect of the invention, a method is provided of using the PAK6KD 

structure coordinates, or the PAK6KD binding site, active site, or accessory binding site 

structure coordinates as an anti-target in rational drug design. When designing compounds 

that modulate a protein target's activity, it is often desirable to increase specificity for the 

target and reduce side effects. The protein structure information is useful to design 

compounds that do not bind to, interact with, or modulate the activity of the protein. Thus, 

one aspect of the present invention comprises the use of anti-target structures to assist in 

selecting a compound that modulates the target, but does not modulate PAK6, or does not 

modulate PAK6 in sufficient amount to cause a detrimental side affect. The target may, 

for example, be another kinase. The target may be another PAK kinase. 

[0044] Thus, in one aspect of the invention, a method is provided of identifying a 

compound that modulates the activity of a target protein, comprising: a) introducing into a 

computer program information derived from structural coordinates defining an active site 
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conformation of a target protein molecule based upon three-dimensional structure 
determination, wherein said program utilizes or displays the three-dimensional structure 
thereof; b) generating a three-dimensional representation of the active site cavity of said 
target protein in said computer program; c) superimposing a model of a test compound on 
the model of said active site of said target protein; d) assessing whether said test 
compound model fits spatially into the active site of said target protein; e) generating a 
three-dimensional representation of a binding pocket of a PAK6KD protein in a computer 
program; f) superimposing a model of said test compound on the model of said binding 
pocket of said PAK6KD protein; and g) assessing whether said test compound model fits 
spatially into said binding pocket of said PAK6KD protein. 

[0045] The binding pocket of the PAK6KD protein may be, for example, an active site 
or an accessory binding site. Said target protein may be a kinase. The test compound 
model may or may not fit spatially into the binding pocket of said PAK6KD protein. The 
method may further comprise performing a fitting operation to computationally analyze 
the association between the test compound and the PAK6KD protein. The test compound 
may bind with greater efficiency to the target protein than to the PAK6KD protein; the test 
compound likely does not bind to the PAK6KD protein. 

[0046] In yet another aspect of the invention, a method is provided for homology 

modeling of a PAK6KD homolog comprising: aligning the amino acid sequence of a 

PAK6KD homolog with an amino acid sequence of PAK6KD; incorporating the sequence 

of the PAK6KD homolog into a model of the structure of PAK6KD, wherein said model 

has the same structure coordinates as the structure coordinates of Fig. 4, or wherein the 

structure coordinates of said model's alpha-carbon atoms have a root mean square 

deviation from the structure coordinates of Fig. 4 of up to about 2.0A, preferably up to 

about 1.75 A, preferably up to about 1.5 A, preferably up to about 1.25 A, preferably up to 

about LOA, and preferably up to about 0.75 A, to yield a preliminary model of said 

homolog; subjecting the preliminary model to energy minimization to yield an energy 

minimized model; and remodeling regions of the energy minimized model where 

stereochemistry restraints are violated to yield a final model of said homolog. 

[0047] The invention also provides PAK6KD in crystalline form, as well as a computer 

or machine readable medium containing information that reflects the three dimensional 

structure of such crystals and/or compounds that interact with them. Also provided is a 

method of producing a computer readable database containing the three-dimensional 
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molecular structure coordinates of a compound capable of binding the active site or 
binding pocket of a PAK6KD but not another protein molecule. Such a method comprises 

a) introducing into a computer program information concerning the structure of PAK6KD; 

b) generating a three-dimensional representation of the active site or binding pocket of 
PAK6KD in said computer program; c) superimposing a three-dimensional model of at 
least one binding test compound on said representation of the active site or binding pocket; 
d) assessing whether said test compound model fits spatially into the active site or binding 
pocket of PAK6KD; e) assessing whether a compound that fits will fit a three-dimensional 
model of another protein, the structural coordinates of which are also introduced into said 
computer program and used to generate a three-dimensional representation of the other 
protein; and f) storing the three-dimensional molecular structure coordinates of a model 
that does not fit the other protein into a computer readable database. An alternative form 
of such a method produces a computer readable database containing the three-dimensional 
molecular structural coordinates of a compound capable of specifically binding the active 
site or binding pocket of PAK6KD, said method comprising introducing into a computer 
program a computer readable database containing the structural coordinates of PAK6KD, 
generating a three-dimensional representation of the active site or binding pocket of 
PAK6KD in said computer program, superimposing a three-dimensional model of at least 
one binding test compound on said representation of the active site or binding pocket, 
assessing whether said test compound model fits spatially into the active site or binding 
pocket of PAK6KD, assessing whether a compound that fits will fit a three-dimensional 
model of another protein, the structural coordinates of which are also introduced into said 
computer program and used to generate a three-dimensional representation of the other 
protein, and storing the three-dimensional molecular structural coordinates of a model that 
does not fit the other protein into a computer readable database. Conversely, such 
methods may be used to determine that compounds identified as binding other proteins do 
not bind PAK6KD. Thus, such methods may use PAK6KD as an anti-target, to identify 
compounds that do not bind PAK6KD. 

[0048] The invention also provides methods comprising the production of a co-crystal 

of a compound and PAK6KD. Such co-crystals may be used in a variety of ways, 

including the determination of structural coordinates of the compound and/or PAK6KD, or 

a binding pocket thereof, in the co-crystal. Such coordinates may be introduced and/or 

stored in a computer readable database in accordance with the present invention for further 
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use. The invention thus provides methods of producing a computer readable database 
comprising a representation of a binding pocket of PAK6KD in a co-crystal with a 
compound, said methods comprising preparing a binding test compound represented in a 
computer readable database produced by any method described herein, forming a co- 
crystal of said compound with a protein comprising a binding pocket of PAK6KD, 
obtaining the structural coordinates of said binding pocket in said co-crystal, and 
introducing the structural coordinates of said binding pocket or said co-crystal into a 
computer-readable database. The invention further provides for a combination of such 
methods with rational compound design by providing methods of producing a computer 
readable database comprising a representation of a binding pocket of PAK6KD in a co- 
crystal with a compound rationally designed to be capable of binding said binding pocket, 
said methods comprising preparing a binding test compound represented in a computer 
readable database produced by any method described herein, forming a co-crystal of said 
compound with a protein comprising a binding pocket of PAK6KD, obtaining the 
structural coordinates of said binding pocket in said co-crystal, and introducing the 
structural coordinates of said binding pocket or said co-crystal into a computer-readable 
database. 

[0049] The invention is illustrated by way of the present application, including working 
examples demonstrating the purification and the crystallization of PAK6KD, the 
characterization of crystals, the collection of diffraction data, and the determination and 
analysis of the three-dimensional structure of PAK6KD. 

BRIEF DESCRIPTION OF THE FIGURES 

[0050] FIG. 1 provides a ribbon diagram of the structure of PAK6KD. 

[0051] FIG. 2 provides the predicted amino acid sequence of the PAK6KD expressed 

protein used to obtain the crystals and structural coordinates of the present invention. 

Note that this amino acid sequence may comprise amino acids encoded by the ORF, as 

well as other amino acids encoded by the expression vector. Further information 

regarding sequence changes, if any, may be found in the examples. 

[0052] FIG. 3 provides a ribbon diagram of the structure of PAK6KD. 

[0053] FIG. 4 (A-RR) provides the molecular structure coordinates of PAK6KD. 

[0054] The following abbreviations are used in Figure 4. 
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[0055] " Atom Type " and " Atom " refer to the individual atom whose coordinates are 
provided, with and without indicating the position of the atom in the amino acid residue, 
respectively. The first letter in the column refers to the element. 

[0056] HETATM refers to atomic coordinates within non-standard HET groups, such 
as prosthetic groups, inhibitors, solvent molecules, and ions for which coordinates are 
supplied. HET ATMS include residues that are a) not one of the standard amino acids, 
including, for example, SeMet and SeCys, b) not one of the nucleic acids (C, G, A, T, U, 
and I), c) not one of the modified versions of nucleic acids (+C, +G, +A, +T, +U, and +1), 
and d) not an unknown amino acid or nucleic acid where UNK is used to indicate the 
unknown residue name. 

[0057] " Residue " refers to the amino acid residue. 

[0058] refers to the residue number, starting from the N-terminal amino acid. The 

number designations of each amino acid residues reflect the position predicted in the 

expressed protein, including the His tag and the initial methionine. 

[0059] " X. Y and Z " provide the Cartesian coordinates of the atom. 

[0060] "B" is a thermal factor that measures movement of the atom around its atomic 

center. 

[0061] " OCC " refers to occupancy, and represents the percentage of time the atom type 
occupies the particular coordinate. OCC values range from 0 to 1, with 1 being 100%. 
[0062] Structure coordinates for PAK6KD according to Figure 4 may be modified by 
mathematical manipulation. Such manipulations include, but are not limited to, 
crystal lographic permutations of the raw structure coordinates, fractionalization of the raw 
structure coordinates, integer additions or subtractions to sets of the raw structure 
coordinates, inversion of the raw structure coordinates, and any combination of the above. 

Abbreviations 

[0063] The amino acid notations used herein for the twenty genetically encoded amino 
acids are: 



Amino Acid 



One-Letter 
Symbol 



Three-Letter 
Symbol 



Alanine 



A 



Ala 



Arginine 



R 



Arg 
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Asparagine 


N 


Asn 


Aspartic acid 


D 


Asp 


Cysteine 


C 


Cys 


Glutamine 


Q 


Gin 


Glutamic acid 


E 


Glu 


Glycine 


G 


Gly 


Histidine 


H 


His 


Isoleucine 


I 


He 


Leucine 


L 


Leu 


Lysine 


K 


Lys 


Methionine 


M 


Met 


Phenylalanine 


F 


Phe 


Proline 


P 


Pro 


Serine 


S 


Ser 


Threonine 


T 


Thr 


Tryptophan 


W 


Trp 


Tyrosine 


Y 


Tyr 


Valine 


V 


Val 



[0064] As used herein, unless specifically delineated otherwise, the three-letter amino 
acid abbreviations designate amino acids in the L-configuration. Amino acids in the D- 
configuration are preceded with a "D-." For example, Arg designates L-arginine and D- 
Arg designates D-arginine. Likewise, the capital one-letter abbreviations refer to amino 
acids in the L-configuration. Lower-case one-letter abbreviations designate amino acids in 
the D-configuration. For example, "R" designates L-arginine and "r" designates D- 
arginine. 

[0065] Unless noted otherwise, when polypeptide sequences are presented as a series of 
one-letter and/or three-letter abbreviations, the sequences are presented in the N— »C 
direction, in accordance with common practice. 

Definitions 

[0066] As used herein, the following terms shall have the following meanings: 
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[0067] " Genetically Encoded Amino Acid " refers to the twenty amino acids that are 
defined by genetic codons. The genetically encoded amino acids are glycine and the L- 
isomers of alanine, valine, leucine, isoleucine, serine, methionine, threonine, 
phenylalanine, tyrosine, tryptophan, cysteine, proline, histidine, aspartic acid, asparagine, 
glutamic acid, glutamine, arginine and lysine. 

[0068] " Non-Geneticallv Encoded Amino Acid " refers to amino acids that are not 
defined by genetic codons. Non-genetically encoded amino acids include derivatives or 
analogs of the genetically-encoded amino acids that are capable of being enzymatically 
incorporated into nascent polypeptides using conventional expression systems, such as 
selenomethionine (SeMet) and selenocysteine (SeCys); isomers of the genetically-encoded 
amino acids that are not capable of being enzymatically incorporated into nascent 
polypeptides using conventional expression systems, such as D-isomers of the genetically- 
encoded amino acids; L- and D-isomers of naturally occurring a-amino acids that are not 
defined by genetic codons, such as a-aminoisobutyric acid (Aib); L- and D-isomers of 
synthetic a-amino acids that are not defined by genetic codons; and other amino acids 
such as P-amino acids/y-amino acids, etc. In addition to the D-isomers of the genetically- 
encoded amino acids, common non-genetically encoded amino acids include, but are not 
limited to norleucine (Nle), penicillamine (Pen), N-me thy 1 valine (MeVal), homocysteine 
(hCys), homoserine (hSer), 2,3-diaminobutyric acid (Dab) and ornithine (Orn). Additional 
exemplary non-genetically encoded amino acids are found, for example, in Practical 
Handbook of Biochemistry and Molecular Biology, Fasman, Ed., CRC Press, Inc., Boca 
Raton, FL, pp. 3-76, 1989, and the various references cited therein. 
[0069] " Hydrophilic Amino Acid " refers to an amino acid having a side chain 
exhibiting a hydrophobicity of up to about zero according to the normalized consensus 
hydrophobicity scale of Eisenberg et al, J. Mol. Biol. 179:125-42, 1984. Genetically 
encoded hydrophilic amino acids include Thr (T), Ser (S), His (H), Glu (E), Asn (N), Gin 
(Q), Asp (D), Lys (K) and Arg (R). Non-genetically encoded hydrophilic amino acids 
include the D-isomers of the above-listed genetically-encoded amino acids, ornithine 
(Orn), 2,3-diaminobutyric acid (Dab) and homoserine (hSer). 

[0070] " Acidic Amino Acid " refers to a hydrophilic amino acid having a side chain pK 
value of up to about 7 under physiological conditions. Acidic amino acids typically have 
negatively charged side chains at physiological pH due to loss of a hydrogen ion. 
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Genetically encoded acidic amino acids include Glu (E) and Asp (D). Non-genetically 
encoded acidic amino acids include D-Glu (e) and D-Asp (d). 

[0071] " Basic Amino Acid " refers to a hydrophilic amino acid having a side chain pK 
value of greater than 7 under physiological conditions. Basic amino acids typically have 
positively charged side chains at physiological pH due to association with hydronium ion. 
Genetically encoded basic amino acids include His (H), Arg (R) and Lys (K). Non- 
genetically encoded basic amino acids include the D-isomers of the above-listed 
genetically-encoded amino acids, ornithine (Orn) and 2,3-diaminobutyric acid (Dab). 
[0072] " Polar Amino Acid " refers to a hydrophilic amino acid having a side chain that 
is uncharged at physiological pH, but which comprises at least one covalent bond in which 
the pair of electrons shared in common by two atoms is held more closely by one of the 
atoms. Genetically encoded polar amino acids include Asn (N), Gin (Q), Ser (S), and Thr 
(T). Non-genetically encoded polar amino acids include the D-isomers of the above-listed 
genetically-encoded amino acids and homoserine (hSer). 

[0073] " Hydrophobic Amino Acid " refers to an amino acid having a side chain 
exhibiting a hydrophobicity of greater than zero according to the normalized consensus 
hydrophobicity scale of Eisenberg etal, J. Mol. Biol. 179:125-42, 1984. Genetically 
encoded hydrophobic amino acids include Pro (P), He (I), Phe (F), Val (V), Leu (L), Trp 
(W), Met (M), Ala (A), Gly (G) and Tyr (Y). Non-genetically encoded hydrophobic 
amino acids include the D-isomers of the above-listed genetically-encoded amino acids, 
norleucine (Nle) and N-methyl valine (MeVal). 

[0074] "Aromatic Amino Acid" refers to a hydrophobic amino acid having a side chain 
comprising at least one aromatic or heteroaromatic ring. The aromatic or heteroaromatic 
ring may contain one or more substituents such as -OH, -SH, -CN, -F, -CI, -Br, -I, -N0 2 , 
-NO, -NH 2 , -NHR, -NRR, -C(0)R, -C(0)OH, -C(0)OR, -C(0)NH 2 , -C(0)NHR, 
-C(0)NRR and the like where each R is independently (CrQ,) alkyl, (Ci-Q) alkenyl, or 
(C|-C6> alkynyl. Genetically encoded aromatic amino acids include Phe (F), Tyr (Y), Trp 
(W) and His (H). Non-genetically encoded aromatic amino acids include the D-isomers of 
the above-listed genetically-encoded amino acids. 

[0075] "Apolar A mino Acid " refers to a hydrophobic amino acid having a side chain 

that is uncharged at physiological pH and which has bonds in which the pair of electrons 

shared in common by two atoms is generally held equally by each of the two atoms (i.e., 

the side chain is not polar). Genetically encoded apolar amino acids include Leu (L), Val 
Express Mail No. EV315135696US 22 



Docket No. PT145/01.PRO 

(V), He (I), Met (M), Gly (G) and Ala (A). Non-genetically encoded apolar amino acids 
include the D-isomers of the above-listed genetically-encoded amino acids, norleucine 
(Nle) and N-methyl valine (MeVal). 

[0076] " Aliphatic Amino Acid " refers to a hydrophobic amino acid having an aliphatic 
hydrocarbon side chain. Genetically encoded aliphatic amino acids include Ala (A), Val 
(V), Leu (L) and He (I). Non-genetically encoded aliphatic amino acids include the D- 
isomers of the above-listed genetically-encoded amino acids, norleucine (Nle) and N- 
methyl valine (MeVal). 

[0077] " Helix-Breaking Amino Acid " refers to those amino acids that have a 
propensity to disrupt the structure of a-helices when contained at internal positions within 
the helix. Amino acid residues exhibiting helix-breaking properties are well-known in the 
art {see, e.g., Chou & Fasman, Ann. Rev. Biochem. 47:251-76, 1978) and include Pro (P), 
D-Pro (p), Gly (G) and potentially all D-amino acids (when contained in an L-polypeptide; 
conversely, L-amino acids disrupt helical structure when contained in a D-polypeptide). 
[0078] " Cvsteine-like Amino Acid " refers to an amino acid having a side chain capable 
of participating in a disulfide linkage. Thus, cysteine-like amino acids generally have a 
side chain containing at least one thiol (-SH) group. Cysteine-like amino acids are unusual 
in that they can form disulfide bridges with other cysteine-like amino acids. The ability of 
Cys (C) residues and other cysteine-like amino acids to exist in a polypeptide in either the 
reduced free -SH or oxidized disulfide-bridged form affects whether they contribute net 
hydrophobic or hydrophilic character to a polypeptide. Thus, while Cys (C) exhibits a 
hydrophobicity of 0.29 according to the consensus scale of Eisenberg (Eisenberg, 1984, 
supra), it is to be understood that for purposes of the present invention Cys (C) is 
categorized as a polar hydrophilic amino acid, notwithstanding the general classifications 
defined above. Other cysteine-like amino acids are similarly categorized as polar 
hydrophilic amino acids. Typical cysteine-like residues include, for example, 
penicillamine (Pen), homocysteine (hCys), etc. 

[0079] As will be appreciated by those of skill in the art, the above-defined classes or 

categories are not mutually exclusive. Thus, amino acids having side chains exhibiting 

two or more physical-chemical properties may be included in multiple categories. For 

example, amino acid side chains having aromatic groups that are further substituted with 

polar substituents, such as Tyr (Y), may exhibit both aromatic hydrophobic properties and 

polar or hydrophilic properties, and could therefore be included in both the aromatic and 
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polar categories. Typically, amino acids will be categorized in the class or classes that 
most closely define their net physical-chemical properties. The appropriate categorization 
of any amino acid will be apparent to those of skill in the art. 

[0080] Other amino acid residues not specifically mentioned herein may be readily 
categorized based on their observed physical and chemical properties in light of the 
definitions provided herein. 

[0081] " Wild-type PAK6KD " refers to a polypeptide having an amino acid sequence 
that corresponds to the amino acid sequence of a naturally-occurring PAK6KD, and 
wherein said polypeptide, when compared to PAK6KD, has an rmsd of its backbone atoms 
of less than 2A. 

[0082] " Homo sapiens PAK6KD " refers to a polypeptide having an amino acid 
sequence that corresponds identically to the wild-type PAK6KD from Homo sapiens. 
[0083] By "or" is meant one, or another member of a group, or more than one member. 
For example, A, B, or C, may indicate any of the following: A alone; B alone; C alone; A 
and B; B and C; A and C; A, B, and C. 

[0084] " Association " refers to the status of two or more molecules that are in close 
proximity to each other. The two molecules may be associated non-covalently, for 
example, by hydrogen-bonding, van der Waals, electrostatic or hydrophobic interactions, 
or covalently. 

[0085] " Co-Complex " refers to a polypeptide in association with one or more 

compounds. The association may be, for example, covalent or non-covalent. A "PAK6KD 

co-complex" refers to PAK6KD, or a functional subunit or fragment thereof, in association 

with one or more compounds. Such compounds include, by way of example and not 

limitation, cofactors, ligands, substrates, substrate analogues, inhibitors, allosteric 

affecters, etc. Lead compounds for designing PAK6 inhibitors include, but are not 

restricted to, ATP; p-amido ATP; AMP-PNP, staurosporine, and derivatives and analogs 

thereof. A co-complex may also refer to a computer represented, or in silica generated 

association between a peptide and a compound. An "unliganded" form of a protein 

structure, or structural coordinates thereof, refers to the coordinates of the native form of a 

protein structure, or the apostructure, not a co-complex. A "liganded" form refers to the 

coordinates of a protein or peptide that is part of a co-complex. Unliganded forms include 

peptides and proteins associated with various ions, such as manganese, zinc, and 

magnesium, as well as with water. Ligands include natural substrates, non-natural 
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substrates, inhibitors, substrate analogs, agonists or antagonists, proteins, co-factors small 
molecules, test compounds, and fragments of test compounds, as well as, optionally, in 
addition, various ions or water. 

[0086] " Mutant " refers to a polypeptide characterized by an amino acid sequence that 
differs from the wild-type sequence by the substitution of at least one amino acid residue 
of the wild-type sequence with a different amino acid residue and/or by the addition and/or 
deletion of one or more amino acid residues to or from the wild-type sequence. The 
additions and/or deletions may be from an internal region of the wild-type sequence and/or 
at either or both of the N- or C-termini. A mutant polypeptide may have substantially the 
same three-dimensional structure as the corresponding wild-type polypeptide. A mutant 
may have, but need not have, PAK6 activity. A mutant may display biological activity 
that is substantially similar to that of the wild-type PAK6KD. By "substantially similar 
biological activity" is meant that the mutant displays biological activity that is within 1% 
to 10,000% of the biological activity of the wild-type polypeptide, for example, within 
25% to 5,000%, and, for example, within 50% to 500%, or 75% to 200% of the biological 
activity of the wild-type polypeptide, using assays known to those of ordinary skill in the 
art for that particular class of polypeptides. Mutants may also decrease or eliminate 
PAK6KD activity. Mutants may be synthesized according to any method known to those 
skilled in the art, including, but not limited to, those methods of expressing PAK6KD 
molecules described herein. 

[0087] "Active Site " refers to a site in PAK6KD that associates with the substrate for 
PAK6 activity. This site may include, for example, residues involved in catalysis, as well 
as residues involved in binding a substrate. Inibitors may bind to the residues of the active 
site. In PAK6KD, the active site includes one or more of the following amino acid 
residues: Val421, Lys436, Glu452, Val465, Met481, Glu482, Leu484, Leu533, Ser543, 
Asp544, Gly414, Glu415, Gly416, Ser417, Thr418, Gly419, Ile420, Ala434, Met438, 
Phe483, Asp530, Leu423, Gln443, Arg445, Leu449, Gln485, Gly486, Gly487, Ala488, 
Asp491, Gly546, Phe547, Arg675, andLys676. 

[0088] Preferably, the active site comprises Val421, Lys436, Glu452, Val465, Met481, 

Glu482, Leu484, Leu533, Ser543, and Asp544, preferably the active site further comprises 

Gly414, GIu415, Gly416, Ser417, Thr418, Gly419, Ile420, Ala434, Met438, Phe483, and 

Asp530. Preferably the active site further comprises Leu423, Gln443, Arg445, Leu449, 

Gln485, Gly486, Gly487, Ala488, Asp491, Gly546, Phe547, Arg675, andLys676 . 
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Amino acid residue numbers presented herein refer to the sequence of wild type full length 
PAK6. Those of ordinary skill in the art may easily determine the residue numbers 
corresponding to any modified form of PAK6 using standard alignment protocols, 
including, for example, alignments done by hand. 

[0089] " Binding Pocket " refers to a region in PAK6 which associates with a ligand 
such as a natural substrate, non-natural substrate, inhibitor, substrate analog, agonist or 
antagonist, protein, co-factor or small molecule, as well as, optionally, in addition, various 
ions or water, and/or has an internal cavity sufficient to bind a small molecule and may be 
used as a target for binding drugs. The term includes the active site but is not limited 
thereby. 

[0090] " Accessory Binding Pocket" refers to a binding pocket in PAK6KD other than 
that of the "active site." 

[0091] " Conservative Mutant " refers to a mutant in which at least one amino acid 
residue from the wild-type sequence is substituted with a different amino acid residue that 
has similar physical and chemical properties, i.e., an amino acid residue that is a member 
of the same class or category, as defined above. For example, a conservative mutant may 
be a polypeptide that differs in amino acid sequence from the wild-type sequence by the 
substitution of a specific aromatic Phe (F) residue with an aromatic Tyr (Y) or Trp (W) 
residue. 

[0092] " Non-Conservative Mutant " refers to a mutant in which at least one amino acid 
residue from the wild-type sequence is substituted with a different amino acid residue that 
has dissimilar physical and/or chemical properties, i.e., an amino acid residue that is a 
member of a different class or category, as defined above. For example, a non- 
conservative mutant may be a polypeptide that differs in amino acid sequence from the 
wild-type sequence by the substitution of an acidic Glu (E) residue with a basic Arg (R), 
Lys (K) or Orn residue. 

[0093] " Deletion Mutant " refers to a mutant having an amino acid sequence that differs 
from the wild-type sequence by the deletion of one or more amino acid residues from the 
wild-type sequence. The residues may be deleted from internal regions of the wild-type 
sequence and/or from one or both termini. 

[0094] " Truncated Mutant " refers to a deletion mutant in which the deleted residues are 
from the N- and/or C-terminus of the wild-type sequence. 



Express Mail No. EV315135696US 



26 



Docket No. PT145/01.PRO 

[0095] " Extended Mutant " refers to a mutant in which additional residues are added to 
the N- and/or C-terminus of the wild-type sequence. 

[0096] " Methionine mutant " refers to (1) a mutant in which at least one methionine 
residue of the wild-type sequence is replaced with another residue, such as with an 
aliphatic residue, such as an Ala (A), Leu (L), or He (I) residue; or (2) a mutant in which a 
non-methionine residue, such as an aliphatic residue, such as an Ala (A), Leu (L) or lie (I) 
residue, of the wild-type sequence is replaced with a methionine residue. 
[0097] " Selenomethionine mutant " refers to (1) a mutant which includes at least one 
selenomethionine (SeMet) residue, typically by substitution of a Met residue of the wild- 
type sequence with a SeMet residue, or by addition of one or more SeMet residues at one 
or both termini, or (2) a methionine mutant in which at least one Met residue is substituted 
with a SeMet residue. In some embodiments, each Met residue is substituted with a 
SeMet residue. 

[0098] " Cysteine mutant " refers to a mutant in which at least one cysteine residue of 
the wild-type sequence is replaced with another residue, such as with a Ser (S) residue. 
[0100] " Serine mutant " refers to a mutant in which at least one serine residue of the 
wild-type sequence is replaced with another residue, such as with a cysteine residue. 
[0101] " Selenocvsteine mutant " refers to (1) a mutant which includes at least one 
selenocysteine (SeCys) residue, typically by substitution of a Cys residue of the wild-type 
sequence with a SeCys residue, or by addition of one or more SeCys residues at one or 
both termini, or (2) a cysteine mutant in which at least one Cys residue is substituted with 
a SeCys residue. In some embodiments, SeCys mutants are those in which each Cys 
residue is substituted with a SeCys residue. 

[0102] "Homolog" refers to a polypeptide having at least 30%, preferably at least 40%, 

preferably at least 50%, preferably at least 60%, preferably at least 70%, more preferably 

t 

at least 80%, and most preferably at least 90% amino acid sequence identity or having a 
BLAST E-value of 1 x 10" 6 over at least 100 amino acids (Altschul et al., Nucleic Acids 
Res., 25:3389-402, 1997) with PAK6KD or any functional domain of PAK6KD. 
[0103] "Crystal" refers to a composition comprising a polypeptide in crystalline form. 
The term "crystal" includes native crystals, heavy-atom derivative crystals and co-crystals, 
as defined herein. 

[0104] " Native Crystal " refers to a crystal wherein the polypeptide is substantially pure. 

As used herein, native crystals do not include crystals of polypeptides comprising amino 
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acids that are modified with heavy atoms, such as crystals of selenomethionine mutants, 
selenocysteine mutants, etc. 

[0105] " Heavy-atom Derivative Crystal " refers to a crystal wherein the polypeptide is 
in association with one or more heavy-metal atoms. As used herein, heavy-atom derivative 
crystals include native crystals into which a heavy metal atom is soaked, as well as 
crystals of selenomethionine mutants and selenocysteine mutants. 
[0106] " Co-Crystal " refers to a crystalline form of a co-complex. 
[0107] " Apo-crystal" refers to a crystal wherein the polypeptide is substantially pure 
and substantially free of compounds that might form a co-complex with the polypeptide 
such as cofactors, ligands, substrates, substrate analogues, inhibitors, allosteric affecters, 
etc. 

[0108] " Diffraction Quality Crystal " refers to a crystal that is well-ordered and of a 
sufficient size, Le. 9 at least 10/im, at least 50/xm, or at least lOOjim in its smallest 
dimension such that it produces measurable diffraction to at least 3A resolution, preferably 
to at least 2 A resolution, and most preferably to at least 1.5 A resolution or lower. 
Diffraction quality crystals include native crystals, heavy-atom derivative crystals, and co- 
crystals. 

[0109] " Unit Cell " refers to the smallest and simplest volume element (i.e., 
parallelepiped-shaped block) of a crystal that is completely representative of the unit or 
pattern of the crystal, such that the entire crystal may be generated by translation of the 
unit cell. The dimensions of the unit cell are defined by six numbers: dimensions a, b and 
c and the angles are defined as a, P, and y (Blundell et cd., Protein Crystallography, 83-84, 
Academic Press. 1976). A crystal is an efficiently packed array of many unit cells. 
[0110] " Triclinic Unit Cell " refers to a unit cell in which a*b*c and o*P*y. 
[0111] " Monoclinic Unit Cell " refers to a unit cell in which a*b9*e; a=^=90°; and 
P>90°. 

[0112] " Hexagonal Unit Cell " refers to a unit cell in which a=b*c; (X=P= 90°; and 
Y=120°. 

[0113] " Orthorhombic Unit Cell " refers to a unit cell in which a*b*c; and <x=0 = 
Y=90°. 

[0114] " Tetragonal Unit Cell " refers to a unit cell in which a=b*c; and a =P =y =90°. 
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[0115] " Trigonal/Rhombohedral Unit Cell " refers to a unit cell in which a=b=c; and 
a=P =y *90°. 

[0116] " Trigonal/Hexagonal Unit Cell " refers to a unit cell in which a=b*c; ot=f}=90°; 
and y=120°. 

[0117] " Cubic Unit Cell " refers to a unit cell in which a=b=c; and a =p =y =90°. 
[0118] " Crystal Lattice " refers to the array of points defined by the vertices of packed 
unit cells. 

[0119] " Space Group " refers to the set of symmetry operations of a unit cell. In a space 
group designation (e.g., C2) the capital letter indicates the lattice type and the other 
symbols represent symmetry operations that may be carried out on the unit cell without 
changing its appearance. 

[0120] " Asymmetric Unit " refers to the largest aggregate of molecules in the unit cell 
that possesses no symmetry elements that are part of the space group symmetry, but that 
may be juxtaposed on other identical entities by symmetry operations. 
[0121] " Crystallographicallv-Related Dimer (or oligomer) " refers to a dimer (or 
oligomer, such as, for example, a trimer or a tetramer) of two (or more) molecules wherein 
the symmetry axes or planes that relate the two (or more) molecules comprising the dimer 
(or oligomer) coincide with the symmetry axes or planes of the crystal lattice. 
[0122] " Non-Crvstallographically-Related Dimer (or oligomer) " refers to a dimer (or 
oligomer, such as, for example, a trimer or a tetramer) of two (or more) molecules wherein 
the symmetry axes or planes that relate the two (or more) molecules comprising the dimer 
(or oligomer) do not coincide with the symmetry axes or planes of the crystal lattice. 
[0123] " Isomorphous Replacement " refers to the method of using heavy-atom 
derivative crystals to obtain the phase information necessary to elucidate the three- 
dimensional structure of a crystallized polypeptide (Blundell et a/., Protein 
Crystallography, Academic Press, esp. pp. 151-64, 1976; Methods in Enzymology 
276:361-557, Academic Press, 1997). The phrase "heavy-atom derivatization" is 
synonymous with "isomorphous replacement." 

[0124] " Multi -Wavelength Anomalous Dispersion or MAD " refers to a 
crystal lographic technique in which X-ray diffraction data are collected at several different 
wavelengths from a single heavy-atom derivative crystal, wherein the heavy atom has 
absorption edges near the energy of incoming X-ray radiation. The resonance between X- 
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rays and electron orbitals leads to differences in X-ray scattering from absorption of the X- 
rays (known as anomalous scattering) and permits the locations of the heavy atoms to be 
identified, which in turn provides phase information for a crystal of a polypeptide. A 
detailed discussion of MAD analysis may be found in Hendrickson, Trans. Am. 
Crystallogr. Assoc., 21:11, 1985; Hendrickson etal, EMBO J. 9:1665, 1990; and 
Hendrickson, Science, 254:51-58, 1991. 

[0125] " Single Wavelength Anomalous Dispersion or SAD " refers to a 

crystal lographic technique in which X-ray diffraction data are collected at a single 

wavelength from a single native or heavy-atom derivative crystal, and phase information is 

extracted using anomalous scattering information from atoms such as sulfur or chlorine in 

the native crystal or from the heavy atoms in the heavy-atom derivative crystal. The 

wavelength of X-rays used to collect data for this phasing technique needs to be close to 

the absorption edge of the anomalous scatterer. A detailed discussion of SAD analysis 

may be found in Brodersen, et al., Acta Cryst, D56:431-41, 2000. 

[0126] " Single Isomorphous Replacement With Anomalous Scattering or SIRAS" 

refers to a crystallographic technique that combines isomorphous replacement and 

anomalous scattering techniques to provide phase information for a crystal of a 

polypeptide. X-ray diffraction data are collected at a single wavelength, usually from a 

single heavy-atom derivative crystal. Phase information obtained only from the location 

of the heavy atoms in a single heavy-atom derivative crystal leads to an ambiguity in the 

phase angle, which is resolved using anomalous scattering from the heavy atoms. Phase 

information is therefore extracted from both the location of the heavy atoms and from 

anomalous scattering of the heavy atoms. A detailed discussion of SIRAS analysis may be 

found in North, Acta Cryst. 18:212-16, 1965; Matthews, Acta Cryst., 20:82-86, 1966. 

[0127] " Molecular Replacement " refers to the method using the structure coordinates 

of a known polypeptide to calculate initial phases for a new crystal of a polypeptide whose 

structure coordinates are unknown. This is done by orienting and positioning a 

polypeptide whose structure coordinates are known within the unit cell of the new crystal. 

Phases are then calculated from the oriented and positioned polypeptide and combined 

with observed amplitudes to provide an approximate Fourier synthesis of the structure of 

the polypeptides comprising the new crystal. The model is then refined to provide a 

refined set of structure coordinates for the new crystal (Lattman, Methods in Enzymology, 

115:55-77, 1985; Rossmann, ft The Molecular Replacement Method," Int. Sci. Rev. Ser. 
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No. 13, Gordon & Breach, New York, 1972; Methods in Enzymology, Vols. 276, 277 
(Academic Press, San Diego 1997)). Molecular replacement may be used, for example, to 
determine the structure coordinates of a crystalline mutant or homolog of PAK6KD using 
the structure coordinates of PAK6KD. 

[0128] " Structure coordinates " refers to mathematical coordinates derived from 
mathematical equations related to the patterns obtained on diffraction of a monochromatic 
beam of X-rays by the atoms (scattering centers) of a PAK6KD in crystal form. The 
diffraction data are used to calculate an electron density map of the repeating unit of the 
crystal. The electron density maps are Used to establish the positions of the individual 
atoms within the unit cell of the crystal. 

[0129] " Having substantially the same three-dimensional structure " refers to a 
polypeptide that is characterized by a set of molecular structure coordinates that have a 
root mean square deviation (r.m.s.d.) of up to about or equal to 1.5 A, preferably 1.25 A, 
preferably 1 A, and preferably 0.5 A, and preferably 0.25 A, when superimposed onto the 
molecular structure coordinates of Fig. 4 when at least 50% to 100% of the C-alpha atoms 
of the coordinates are included in the superposition. The program MOE may be used to 
compare two structures (Chemical Computing Group, Inc., Montreal, Canada). Where 
structure coordinates are not available for a particular amino acid residue(s), those 
coordinates are not included in the calculation. 

[0130] "OzC" or " a-carbon " or "CA" as used herein, "a-C" or "a-carbon M refer to the 
alpha carbon of an amino acid residue. 

[0131] " a-helix " refers to the conformation of a polypeptide chain in the form of a 
spiral chain of amino acids stabilized by hydrogen bonds. 

[0132] The term "B-sheet " refers to the conformation of a polypeptide chain stretched 
into an extended zig-zag conformation. Portions of polypeptide chains that run "parallel" 
all run in the same direction. Where polypeptide chains are "antiparallel," neighboring 
chains run in opposite directions from each other. The term "run" refers to the N to 
COOH direction of the polypeptide chain. 
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DETAILED DESCRIPTION OF THE INVENTION 



Crystalline PAK6 

[0133] Both native and heavy-atom derivative crystals, such as those obtained from 
selenium methionine derivative PAK6KD may be used to obtain the molecular structure 
coordinates of the present invention. 

[0134] The PAK6 comprising the crystals of the invention may be isolated from any 
bacterial, plant, or animal source in which PAK6 is present. Within the scope of the 
present invention are proteins that are homologous to PAK6 that are derived from any 
biological kingdom. The PAK6 may be derived from a mammalian source, such as, for 
example, Homo sapiens. The crystals may comprise wild-type PAK6 or mutants of wild- 
type PAK6. Mutants of wild-type PAK6 are obtained by replacing at least one amino acid 
residue in the sequence of the wild-type PAK6 with a different amino acid residue, or by 
adding or deleting one or more amino acid residues within the wild-type sequence and/or 
at the N- and/or C-terminus of the wild-type PAK6. The mutants may, but not necessarily, 
crystallize under crystallization conditions that are substantially similar to those used to 
crystallize the wild-type PAK6. 

[0135] The types of mutants contemplated by this invention include, but are not limited 
to, conservative mutants, non-conservative mutants, deletion mutants, truncated mutants, 
extended mutants, methionine mutants, selenomethionine mutants, cysteine mutants and 
selenocysteine mutants. A mutant may have, but need not display, PAK6 activity. A 
mutant may, for example, display biological activity that is substantially similar to that of 
the wild-type polypeptide. Methionine, selenomethione, cysteine, and selenocysteine 
mutants are particularly useful for producing heavy-atom derivative crystals, as described 
in detail, below. 

[0136] It will be recognized by one of skill in the art that the types of mutants 
contemplated herein are not mutually exclusive; that is, for example, a polypeptide having 
a conservative mutation in one amino acid may in addition have a truncation of residues at 
the N-terminus, and several Ala, Leu, or Ile-»Met mutations. 

[0137] Sequence alignments of polypeptides in a protein family or of homologous 

polypeptide domains may be used to identify potential amino acid residues in the 

polypeptide sequence that are candidates for mutation. Identifying mutations that do not 

significantly interfere with the three-dimensional structure of PAK6 and/or that do not 
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deleteriously affect, and that may even enhance, the activity of PAK6 will depend, in part, 
on the region where the mutation occurs. In highly variable regions of the molecule, non- 
conservative substitutions as well as conservative substitutions may be tolerated without 
significantly disrupting the folding, the three-dimensional structure and/or the biological 
activity of the molecule. In highly conserved regions, or regions containing significant 
secondary structure, conservative amino acid substitutions may be tolerated. 
[0138] Conservative amino acid substitutions are well known in the art, and include 
substitutions made on the basis of a similarity in polarity, charge, solubility, 
hydrophobicity and/or the hydrophilicity of the amino acid residues involved. Typical 
conservative substitutions are those in which the amino acid is substituted with a different 
amino acid that is a member of the same class or category, as those classes are defined 
herein. Thus, typical conservative substitutions include aromatic to aromatic, apolar to 
apolar, aliphatic to aliphatic, acidic to acidic, basic to basic, polar to polar, etc. Other 
conservative amino acid substitutions are well known in the art. It will be recognized by 
those of skill in the art that generally, a total of 20% or fewer, typically 10% or fewer, 
most usually 5% or fewer, of the amino acids in the wild-type polypeptide sequence may 
be conservatively substituted with other amino acids without deleteriously affecting the 
biological activity, the folding, and/or the three-dimensional structure of the molecule, 
provided that such substitutions do not involve residues that are critical for activity, for 
example, critical binding pocket residues. 

[0139] In some embodiments, it may be desirable to make mutations in the active site 
of a protein, e.g., to reduce or completely eliminate protein activity. For example, it may 
be desirable to mutate important residues in the active site of a protease in order to reduce 
or eliminate protease activity and to avoid autolysis in solution or in a crystal. Thus, for 
example, in aspartyl proteases, the active site Asp residue may be mutated to an Ala or 
Asn residue to reduce protease activity. The active site Ser residue in serine proteases 
may be mutated to an Ala, Cys or Thr residue to reduce or eliminate protease activity. 
Similarly, the activity of a cysteine protease may be reduced or eliminated by mutating the 
active site Cys residue to an Ala, Ser or Thr residue. Other mutations that will reduce or 
completely eliminate the activity of a particular protein will be apparent to those of skill in 
the art. 

[0140] The amino acid residue Cys (C) is unusual in that it can form disulfide bridges 

with other Cys (C) residues or other sulfhydryls, such as, for example, sulfhydryl- 
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containing amino acids ("cysteine-like amino acids"). The ability of Cys (C) residues and 
other cysteine-like amino acids to exist in a polypeptide in either the reduced free -SH or 
oxidized disulfide-bridged form affects whether Cys (C) residues contribute net 
hydrophobic or hydrophilic character to a polypeptide. While Cys (C) exhibits a 
hydrophobicity of 0.29 according to the consensus scale of Eisenberg (Eisenberg et al. y J. 
Mol. Biol. 179:125-42, 1984), it is to be understood that for purposes of the present 
invention Cys (C) is categorized as a polar hydrophilic amino acid, notwithstanding the 
general classifications defined above. For example, Cys residues that are known to 
participate in disulfide bridges are not substituted or are conservatively substituted with 
other cysteine-like amino acids so that the residue can participate in a disulfide bridge. 
Typical cysteine-like residues include, for example, Pen, hCys, etc. Substitutions for Cys 
residues that interfere with crystallization are discussed infra. 

[0141] The structural coordinates of a binding pocket and/or of the protein may be 
used, for example, to engineer new molecules. These new molecules may be expressed in 
cells, for example, in plant cells using, for example, gene transformation, to improve 
nutrient yields in plant crops or to use plants to produce new molecules. 
[0142] While in most instances the amino acids of PAK6KD will be substituted with 
genetically-encoded amino acids, in certain circumstances mutants may include non- 
genetically encoded amino acids. For example, non-encoded derivatives of certain 
encoded amino acids, such as SeMet and/or SeCys, may be incorporated into the 
polypeptide chain using biological expression systems (such SeMet and SeCys mutants are 
described in more detail, infra). 

[0143] Alternatively, in instances where the mutant will be prepared in whole or in part 
by chemical synthesis, virtually any non-encoded amino acids may be used, ranging from 
D-isomers of the genetically encoded amino acids to non-encoded naturally-occurring 
natural and synthetic amino acids. 

[0144] Conservative amino acid substitutions for many of the commonly known non- 

genetically encoded amino acids are well known in the art. Conservative substitutions for 

other non-encoded amino acids may be determined based on their physical properties as 

compared to the properties of the genetically encoded amino acids. 

[0145] Those of ordinary skill in the art will recognize that substitutions, additions, 

and/or deletions that do not substantially alter the three dimensional structure of PAK6KD 

and that, for example, do not substantially alter the three dimensional structure of the 
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PAK6KD binding pocket or pockets discussed in the present application, are within the 
scope of the present invention. Such substitutions, additions, and/or deletions may be 
useful, for example, to provide convenient cloning sites in cDNA encoding PAK6, to aid 
in its purification, or to aid in obtaining crystallization. 

[0146] These substitutions, deletions and/or additions include, but are not limited to, 
His tags, intein-containing self-cleaving tags, maltose binding protein fusions, glutathione 
S-transferase protein fusions, antibody fusions, green fluorescent protein fusions, signal 
peptide fusions, biotin accepting peptide fusions, tags that contain protease cleavage sites, 
and the like. Mutations may also be introduced into a polypeptide sequence where there 
are residues, e.g., cysteine residues that interfere with crystallization. These cysteine 
residues may be substituted with an appropriate amino acid that does not readily form 
covalent bonds with other amino acid residues under crystallization conditions; e.g., by 
substituting the cysteine with Ala, Ser or Gly. Any cysteine located in a non-helical or 
non-stranded segment, based on secondary structure assignments, are good candidates for 
replacement. 

[0147] Mutants within the scope of the invention may or may not have PAK6 activity. 
Amino acid substitutions, additions and/or deletions that might alter or inhibit PAK6 
activity are within the scope of the present invention. These mutants may be used in their 
crystalline form, or the molecular structure coordinates obtained therefrom, for example, 
to determine PAK6 structure and/or to provide phase information to aid the determination 
of the three-dimensional X-ray structures of other related or non-related crystalline 
polypeptides. 

[0148] The heavy-atom derivative crystals from which the molecular structure 
coordinates of the invention are obtained generally comprise a crystalline PAK6KD 
polypeptide in association with one or more heavy atoms, such as, for example, Xe, Kr, 
Br, I, or a heavy metal atom. The polypeptide may correspond to a wild- type or a mutant 
PAK6KD, which may optionally be in co-complex with one or more molecules, as 
previously described. There are various types of heavy-atom derivatives of polypeptides: 
heavy-atom derivatives resulting from exposure of the protein to a heavy atom in solution, 
wherein crystals are grown in medium comprising the heavy atom, or in crystalline form, 
wherein die heavy atom diffuses into the crystal, heavy-atom derivatives wherein the 
polypeptide comprises heavy-atom containing amino acids, e.g., selenomethionine and/or 
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selenocysteine, and heavy atom derivatives where the heavy atom is forced in under 
pressure, such as, for example, in a xenon chamber. 

[0149] In practice, heavy-atom derivatives of the first type may be formed by soaking a 
native crystal in a solution comprising heavy metal atom salts, or organometallic 
compounds, e.g., lead chloride, gold thiomalate, ethylmercurithiosalicylic acid-sodium salt 
(thimerosal), uranyl acetate, platinum tetrachloride, osmium tetraoxide, zinc sulfate, and 
cobalt hexamine, which can diffuse through the crystal and bind to the crystalline 
polypeptide. 

[0150] Heavy-atom derivatives of this type can also be formed by adding to a 

crystallization solution comprising the polypeptide to be crystallized, an amount of a 

heavy metal atom salt, which may associate with the protein and be incorporated into the 

crystal. The location(s) of the bound heavy metal atom(s) may be determined by X-ray 

diffraction analysis of the crystal. This information, in turn, is used to generate the phase 

information needed to construct the three-dimensional structure of the protein. 

[0151] Heavy-atom derivative crystals may also be prepared from polypeptides that 

include one or more SeMet and/or SeCys residues (SeMet and/or SeCys mutants). Such 

selenocysteine or selenomethionine mutants may be made from wild-type or mutant 

PAK6KD by expression of PAK6KD-encoding cDNAs in auxotrophic E coli strains 

(Hendrickson et al, EMBO J. 9(5): 1665-72, 1990). In this method, the wild-type or 

mutant PAK6KD cDNA may be expressed in a host organism on a growth medium 

depleted of either natural cysteine or methionine (or both) but enriched in selenocysteine 

or selenomethionine (or both). Alternatively, selenocysteine or selenomethionine mutants 

may be made using nonauxotrophic E. coli strains, e.g., by inhibiting methionine 

biosynthesis in these strains with high concentrations of He, Lys, Phe, Leu, Val or Thr and 

then providing selenomethionine in the medium (Doubli6, Methods in Enzymology, 

276:523-30, 1997). Furthermore, selenocysteine may be selectively incorporated into 

polypeptides by exploiting the prokaryotic and eukaryotic mechanisms for selenocysteine 

incorporation into certain classes of proteins in vivo, as described in U.S. Patent No. 

5,700,660 to Leonard et al. (filed June 7, 1995). One of skill in the art will recognize that 

selenocysteine may, for example, not incorporated in place of cysteine residues that form 

disulfide bridges, as these may be important for maintaining the three-dimensional 

structure of the protein and may, for example, not be eliminated. One of skill in the art 

will further recognize that, in order to obtain accurate phase information, approximately 
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one selenium atom should be incorporated for every 140 amino acid residues of the 
polypeptide chain. The number of selenium atoms incorporated into the polypeptide chain 
may be conveniently controlled by designing a Met or Cys mutant having an appropriate 
number of Met and/or Cys residues, as described more fully below. 

[0152] In some instances, the polypeptide to be crystallized may not contain cysteine or 
methionine residues. Therefore, if selenomethionine and/or selenocysteine mutants are to 
be used to obtain heavy-atom derivative crystals, methionine and/or cysteine residues may 
be introduced into the polypeptide chain. Likewise, Cys residues must be introduced into 
the polypeptide chain if the use of a cysteine-binding heavy metal, such as mercury, is 
contemplated for production of a heavy-atom derivative crystal. 

[0153] Such mutations are, for example, introduced into the polypeptide sequence at 
sites that will not disturb the overall protein fold. For example, a residue that is conserved 
among many members of the protein family or that is thought to be involved in 
maintaining its activity or structural integrity, as determined by, e.g., sequence alignments, 
should not be mutated to a Met or Cys. In addition, conservative mutations, such as Ser to 
Cys, or Leu or lie to Met, are, for example, introduced. One additional consideration is 
that, in order for a heavy-atom derivative crystal to provide phase information for structure 
determination, the location of the heavy atom(s) in the crystal unit cell must be 
determinable and provide phase information. Therefore, a mutation is, for example, not 
introduced into a portion of the protein that is likely to be mobile, e.g., at, or within 1-5 
residues of, the N- and C-termini, or within loops. 

[0154] Conversely, if there are too many methionine and/or cysteine residues in a 
polypeptide sequence, over-incorporation of the selenium-containing side chains can lead 
to the inability of the polypeptide to fold and/or crystallize, and may potentially lead to 
complications in solving the crystal structure. In this case, methionine and/or cysteine 
mutants are prepared by substituting one or more of these Met and/or Cys residues with 
another residue. The considerations for these substitutions are the same as those discussed 
above for mutations that introduce methionine and/or cysteine residues into the 
polypeptide. Specifically, the Met and/or Cys residues are, for example, conservatively 
substituted with Leu/Ile and Ser, respectively. 

[0155] As DNA encoding cysteine and methionine mutants may be used in the methods 

described above for obtaining SeCys and SeMet heavy-atom derivative crystals, the Cys or 

Met mutant may have, for example, one Cys or Met residue for every 140 amino acids. 
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Production of Polypeptides 

[0156] The native and mutated PAK6KD or PAK6 polypeptides described herein may 
be chemically synthesized in whole or part using techniques that are well known in the art 
(see, e.g., Creighton, Proteins: Structures and Molecular Principles, W.H. Freeman & Co., 
NY, 1983). 

[0157] Gene expression systems may be used for the synthesis of native and mutated 
polypeptides. Expression vectors containing the native or mutated polypeptide coding 
sequence and appropriate transcriptional/translational control signals, that are known to 
those skilled in the art may be constructed. These methods include in vitro recombinant 
DNA techniques, synthetic techniques and in vivo recombination/genetic recombination. 
See, for example, the techniques described in Sambrook et al, Molecular Cloning: A 
Laboratory Manual, Cold Spring Harbor Laboratory, NY, 2001, and Ausubel et al, 
Current Protocols in Molecular Biology, Greene Publishing Associates and Wiley 
Interscience, NY, 1989. 

[0158] Host-expression vector systems may be used to express PAK6KD or PAK6. 
These include, but are not limited to, microorganisms such as bacteria transformed with 
recombinant bacteriophage DNA, plasmid DNA or cosmid DNA expression vectors 
containing the coding sequence; yeast transformed with recombinant yeast expression 
vectors containing the coding sequence; insect cell systems infected with recombinant 
virus expression vectors (e.g., baculovirus) containing the coding sequence; plant cell 
systems infected with recombinant virus expression vectors (e.g., cauliflower mosaic 
virus, CaMV ; tobacco mosaic virus, TMV) or transformed with recombinant plasmid 
expression vectors (e.g., Ti plasmid) containing the coding sequence; or animal cell 
systems. The protein may also be expressed in human gene therapy systems, including, 
for example, expressing the protein to augment the amount of the protein in an individual, 
or to express an engineered therapeutic protein. The expression elements of these systems 
vary in their strength and specificities. 

[0159] Specifically designed vectors allow the shuttling of DNA between hosts such as 

bacteria-yeast or bacteria-animal cells. An appropriately constructed expression vector 

may contain: an origin of replication for autonomous replication in host cells, one or more 

selectable markers, a limited number of useful restriction enzyme sites, a potential for high 

copy number, and active promoters. A promoter is defined as a DNA sequence that 
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directs RNA polymerase to bind to DNA and initiate RNA synthesis. A strong promoter 
is one that causes mRNAs to be initiated at high frequency. 
[0160] The expression vector may also comprise various elements that affect 
transcription and translation, including, for example, constitutive and inducible promoters. 
These elements are often host and/or vector dependent. For example, when cloning in 
bacterial systems, inducible promoters such as the T7 promoter, pL of bacteriophage X, 
plac, ptrp, ptac (ptrp-lac hybrid promoter) and the like may be used; when cloning in 
insect cell systems, promoters such as the baculovirus polyhedrin promoter may be used; 
when cloning in plant cell systems, promoters derived from the genome of plant cells (e.g., 
heat shock promoters; the promoter for the small subunit of RUBISCO; the promoter for 
the chlorophyll a/b binding protein) or from plant viruses (e.g., the 35S RNA promoter of 
CaMV; the coat protein promoter of TMV) may be used; when cloning in mammalian cell 
systems, mammalian promoters (e.g., metallothionein promoter) or mammalian viral 
promoters, (e.g., adenovirus late promoter; vaccinia virus 7.5K promoter; SV40 promoter; 
bovine papilloma virus promoter; and Epstein-Barr virus promoter) may be used. 
[0161] Various methods may be used to introduce the vector into host cells, for 
example, transformation, transfection, infection, protoplast fusion, and electroporation. 
The expression vector-containing cells are clonally propagated and individually analyzed 
to determine whether they produce the appropriate polypeptides. Various selection 
methods, including, for example, antibiotic resistance, may be used to identify host cells 
that have been transformed. Identification of polypeptide expressing host cell clones may 
be done by several means, including but not limited to immunological reactivity with anti- 
PAK6KD or PAK6 antibodies, and the presence of host cell-associated activity. 
[0162] Expression of cDNA may also be performed using in vitro produced synthetic 
mRNA. Synthetic mRNA may be efficiently translated in various cell-free systems, 
including but not limited to wheat germ extracts and reticulocyte extracts, as well as 
efficiently translated in cell-based systems, including, but not limited, to microinjection 
into frog oocytes. 

[0163] To determine the cDNA sequence(s) that yields optimal levels of activity and/or 

protein, modified cDNA molecules are constructed. A non-limiting example of a 

modified cDNA is where the codon usage in the cDNA has been optimized for the host 

cell in which the cDNA will be expressed. Host cells are transformed with the cDNA 

molecules and the levels of PAK6KD or PAK6 RNA and/or protein are measured. 
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[0164] Levels of PAK6 or PAK6KD protein in host cells are quantitated by a variety of 
methods such as immunoaffinity and/or ligand affinity techniques, PAK6 or PAK6KD- 
specific affinity beads or specific antibodies are used to isolate 35 S-methionine labeled or 
unlabeled protein. Labeled or unlabeled protein is analyzed by SDS-PAGE. Unlabeled 
protein is detected by Western blotting, ELISA or RIA employing specific antibodies. 
[0165] Following expression of PAK6 or PAK6KD in a recombinant host cell, 
polypeptides may be recovered to provide the protein in active form. Several purification 
procedures are available and suitable for use. Recombinant PAK6 or PAK6KD may be 
purified from cell lysates or from conditioned culture media, by various combinations of, 
or individual application of, fractionation, or chromatography steps that are known in the 
art. 

[0166] In addition, recombinant PAK6 or PAK6KD may be separated from other 
cellular proteins by use of an immuno-affinity column made with monoclonal or 
polyclonal antibodies specific for full length nascent protein or polypeptide fragments 
thereof. Other affinity based purification techniques known in the art may also be used. 
[0167] Alternatively, the polypeptides may be recovered from a host cell in an 
unfolded, inactive form, e.g., from inclusion bodies of bacteria. Proteins recovered in this 
form may be solubilized using a denaturant, e.g., guanidinium hydrochloride, and then 
refolded into an active form using methods known to those skilled in the art, such as 
dialysis. 

Crystallization Of Polypeptides And Characterization Of Crystal 
[0168] Various methods known in the art may be used to produce the native and heavy- 
atom derivative crystals of the present invention. Methods include, but are not limited to, 
batch, liquid bridge, dialysis, and vapor diffusion (see, e.g., McPherson, Crystallization of 
Biological Macromolecules, Cold Spring Harbor Press, New York, 1998; McPherson, Eur. 
J. Biochem. 189:1-23, 1990; Weber, Adv. Protein Chem. 41:1-36, 1991; Methods in 
Enzymology 276:13-22, 100-110; 131-143, Academic Press, San Diego, 1997). 
[0169] Generally, native crystals are grown by dissolving substantially pure 
polypeptide in an aqueous buffer containing a precipitant at a concentration just below that 
necessary to precipitate the protein. Examples of precipitants include, but are not limited 
to, polyethylene glycol, ammonium sulfate, 2-methyl-2,4-pentanediol, sodium citrate, 

sodium chloride, glycerol, isopropanol, lithium sulfate, sodium acetate, sodium formate, 
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potassium sodium tartrate, ethanol, hexanediol, ethylene glycol, dioxane, t-butanol and 
combinations thereof. Water is removed by controlled evaporation to produce 
precipitating conditions, which are maintained until crystal growth ceases. 
[0170] In one embodiment, native crystals are grown by vapor diffusion in hanging 
drops or sitting drops (McPherson, Preparation and Analysis of Protein Crystals, John 
Wiley, New York, 1982; McPherson, Eur. J. Biochem. 189:1-23, 1990). Generally, up to 
about 25 fiL, or up to about 5|al, 3|ml, or 2^1, of substantially pure polypeptide solution is 
mixed with a volume of reservoir solution. The ratio may vary according to biophysical 
conditions, for example, the ratio of protein volume: reservoir volume in the drop may be 
1:1, giving a precipitant concentration about half that required for crystallization. Those 
of ordinary skill in the art recognize that the drop and reservoir volumes may be varied 
within certain biophysical conditions and still allow crystallization. In the sitting drop 
method, the polypeptide/precipitant solution is allowed to equilibrate in a closed container 
with a larger aqueous reservoir having a precipitant concentration optimal for producing 
crystals. In the hanging drop method, the polypeptide solution mixed with reservoir 
solution is suspended as a droplet underneath, for example, a coverslip, which is sealed 
onto the top of the reservoir. For both methods, the sealed container is allowed to stand, 
usually, for example, for up to 2-6 weeks, until crystals grow. The drop may be checked 
periodically to determine if a crystal has formed. One way of viewing the drop is using, 
for example, a microscope. One method of checking the drop, for high throughput 
purposes, includes methods that may be found in, for example, U.S. Utility Patent 
Application 10/042,929, filed October 18, 2001, entitled "Apparatus and Method for 
Identification of Crystals By In-situ X-Ray Diffraction." Such methods include, for 
example, using an automated apparatus comprising a crystal growing incubator, an X-ray 
source adjacent to the crystal growing incubator, where the X-ray source is configured to 
irradiate the crystalline material grown in the crystal growing incubator, and an X-ray 
detector configured to detect the presence of the diffracted X-rays from crystalline 
material grown in the incubator. In some examples, a charge coupled video camera is 
included in the detector system. 

[0171] Those having skill in the art will recognize that the above-described 

crystallization conditions may be varied. Such variations may be used alone or in 

combination, and may include various volumes of protein solution and reservoir solution 

known to those of ordinary skill in the art. Other buffer solutions may be used such as 
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Tris, imidazole, or MOPS buffer, so long as the desired pH range is maintained, and the 
chemical composition of the buffer is compatible with crystal formation. Compounds or 
other ligands may be added to the crystallization solution in order to obtain co-crystals. 
[0172] Heavy-atom derivative crystals may be obtained by soaking native crystals in 
mother liquor containing salts of heavy metal atoms and can also be obtained from SeMet 
and/or SeCys mutants, as described above for native crystals. 
[0173] Mutant proteins may crystallize under slightly different crystallization 
conditions than wild-type protein, or under very different crystallization conditions, 
depending on the nature of the mutation, and its location in the protein. For example, a 
non-conservative mutation may result in alteration of the hydrophilicity of the mutant, 
which may in turn make the mutant protein either more soluble or less soluble than the 
wild-type protein. Typically, if a protein becomes more hydrophilic as a result of a 
mutation, it will be more soluble than the wild-type protein in an aqueous solution and a 
higher precipitant concentration will be needed to cause it to crystallize. Conversely, if a 
protein becomes less hydrophilic as a result of a mutation, it will be less soluble in an 
aqueous solution and a lower precipitant concentration will be needed to cause it to 
crystallize. If the mutation happens to be in a region of the protein involved in crystal 
lattice contacts, crystallization conditions may be affected in more unpredictable ways. 

Characterization of Crystals 

[0174] The dimensions of a unit cell of a crystal are defined by six numbers, the 
lengths of three unique edges, a, b, and c, and three unique angles a, P, and y. The type of 
unit cell that comprises a crystal is dependent on the values of these variables, as discussed 
above. 

[0175] When a crystal is exposed to an X-ray beam, the electrons of the molecules in 

the crystal diffract the beam such that there is a sphere of diffracted X-rays around the 

crystal. The angle at which diffracted beams emerge from the crystal may be computed by 

treating diffraction as if it were reflection from sets of equivalent, parallel planes of atoms 

in a crystal (Bragg' s Law). The most obvious sets of planes in a crystal lattice are those 

that are parallel to the faces of the unit cell. These and other sets of planes may be drawn 

through the lattice points. Each set of planes is identified by three indices, hkl. The h 

index gives the number of parts into which the a edge of the unit cell is cut, the k index 

gives the number of parts into which the b edge of the unit cell is cut, and the 1 index gives 
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the number of parts into which the c edge of the unit cell is cut by the set of hkl planes. 
Thus, for example, the 235 planes cut the a edge of each unit cell into halves, the b edge of 
each unit cell into thirds, and the c edge of each unit cell into fifths. Planes that are 
parallel to the be face of the unit cell are the 100 planes; planes that are parallel to the ac 
face of the unit cell are the 010 planes; and planes that are parallel to the ab face of the 
unit cell are the 001 planes. 

[0176] When a detector is placed in the path of the diffracted X-rays, in effect cutting 
into the sphere of diffraction, a series of spots, or reflections, may be recorded of a still 
crystal (not rotated) to produce a "still" diffraction pattern. Each reflection is the result of 
X-rays reflecting off one set of parallel planes, and is characterized by an intensity, which 
is related to the distribution of molecules in the unit cell, and hkl indices, which 
correspond to the parallel planes from which the beam producing that spot was reflected. 
If the crystal is rotated about an axis perpendicular to the X-ray beam, a large number of 
reflections are recorded on the detector, resulting in a diffraction pattern. 
[0177] The unit cell dimensions and space group of a crystal may be determined from 
its diffraction pattern. First, the spacing of reflections is inversely proportional to the 
lengths of the edges of the unit cell. Therefore, if a diffraction pattern is recorded when 
the X-ray beam is perpendicular to a face of the unit cell, two of the unit cell dimensions 
may be deduced from the spacing of the reflections in the x and y directions of the 
detector, the crystal-to-detector distance, and the wavelength of the X-rays. Those of skill 
in the art will appreciate that, in order to obtain all three unit cell dimensions, the crystal 
must be rotated such that the X-ray beam is perpendicular to another face of the unit cell. 
Second, the angles of a unit cell may be determined by the angles between lines of spots 
on the diffraction pattern. Third, the absence of certain reflections and the repetitive 
nature of the diffraction pattern, which may be evident by visual inspection, indicate the 
internal symmetry, or space group, of the crystal. Therefore, a crystal may be 
characterized by its unit cell and space group, as well as by its diffraction pattern. 
[0178] Once the dimensions of the unit cell are determined, the likely number of 
polypeptides in the asymmetric unit may be deduced from the size of the polypeptide, the 
density of the average protein, and the typical solvent content of a protein crystal, which is 
usually in the range of 30-70% of the unit cell volume (Matthews, J. Mol. Biol. 33(2):491- 
97, 1968). 
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Collection of Data and Determination of Structure Solutions 

[0179] The diffraction pattern is related to the three-dimensional shape of the molecule 
by a Fourier transform. The process of determining the solution is in essence a re- 
focusing of the diffracted X-rays to produce a three-dimensional image of the molecule in 
the crystal. Since re-focusing of X-rays cannot be done with a lens at this time, it is done 
via mathematical operations. 

[0180] The sphere of diffraction has symmetry that depends on the internal symmetry 
of the crystal, which means that certain orientations of the crystal will produce the same 
set of reflections. Thus, a crystal with high symmetry has a more repetitive diffraction 
pattern, and there are fewer unique reflections that need to be recorded in order to have a 
complete representation of the diffraction. The goal of data collection, a dataset, is a set of 
consistently measured, indexed intensities for as many reflections as possible. A complete 
dataset is collected if at least 80%, preferably at least 90%, most preferably at least 95% of 
unique reflections are recorded. In one embodiment, a complete dataset is collected using 
one crystal. In another embodiment, a complete dataset is collected using more than one 
crystal of the same type. 

[0181] Sources of X-rays include, but are not limited to, a rotating anode X-ray 
generator such as a Rigaku RU-200, a micro source or mini-source, a sealed-beam source, 
or a beam line at a synchrotron light source, such as the Advanced Photon Source at 
Argonne National Laboratory. Suitable detectors for recording diffraction patterns 
include, but are not limited to, X-ray sensitive film, multiwire area detectors, image plates 
coated with phosphorus, and CCD cameras. Typically, the detector and the X-ray beam 
remain stationary, so that, in order to record diffraction from different parts of the crystal's 
sphere of diffraction, the crystal itself is moved via an automated system of moveable 
circles called a goniostat. 

[0182] One of the biggest problems in data collection, particularly from 

macromolecular crystals having a high solvent content, is the rapid degradation of the 

crystal in the X-ray beam. In order to slow the degradation, data is often collected from a 

crystal at liquid nitrogen temperatures. In order for a crystal to survive the initial exposure 

to liquid nitrogen, the formation of ice within the crystal may be prevented by the use of a 

cryoprotectant. Suitable cryoprotectants include, but are not limited to, low molecular 

weight polyethylene glycols, ethylene glycol, sucrose, glycerol, xylitol, and combinations 

thereof. Crystals may be soaked in a solution comprising the one or more cryoprotectants 
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prior to exposure to liquid nitrogen, or the one or more cryoprotectants may be added to 
the crystallization solution. Data collection at liquid nitrogen temperatures may allow the 
collection of an entire dataset from one crystal. 

[0183] Once a dataset is collected, the information is used to determine the three- 
dimensional structure of the molecule in the crystal. This phase information may be 
acquired by methods described below in order to perform a Fourier transform on the 
diffraction pattern to obtain the three-dimensional structure of the molecule in the crystal. 
It is the determination of phase information that in effect refocuses X-rays to produce the 
image of the molecule. 

[0184] One method of obtaining phase information is by isomorphous replacement, in 
which heavy-atom derivative crystals are used. In this method, the positions of heavy 
atoms bound to the molecules in the heavy-atom derivative crystal are determined, and 
this information is then used to obtain the phase information necessary to elucidate the 
three-dimensional structure of a native crystal (Blundell etaL, Protein Crystallography, 
Academic Press, 1976). 

[0185] Another method of obtaining phase information is by molecular replacement, 
which is a method of calculating initial phases for a new crystal of a polypeptide whose 
structure coordinates are unknown by orienting and positioning a polypeptide whose 
structure coordinates are known within the unit cell of the new crystal so as to best 
account for the observed diffraction pattern of the new crystal. Phases are then calculated 
from the oriented and positioned polypeptide and combined with observed amplitudes to 
provide an approximate Fourier synthesis of the structure of the molecules comprising the 
new crystal (Lattman, Methods in Enzymology 115:55-77, 1985; Rossmann, "The 
Molecular Replacement Method," Int. Sci. Rev. Ser. No. 13, Gordon & Breach, New 
York, 1972). 

[0186] A third method of phase determination is multi-wavelength anomalous 
diffraction or MAD. In this method, X-ray diffraction data are collected at several 
different wavelengths from a single crystal containing at least one heavy atom with 
absorption edges near the energy of incoming X-ray radiation. The resonance between X- 
rays and electron orbitals leads to differences in X-ray scattering that permits the locations 
of the heavy atoms to be identified, which in turn provides phase information for a crystal 
of a polypeptide. A detailed discussion of MAD analysis may be found in Hendrickson, 
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Trans. Am. Crystallogr. Assoc., 21:11, 1985; Hendrickson et a/., EMBO J. 9: 1665, 1990; 
and Hendrickson, Science, 254:51-58, 1991). 

[0187] A fourth method of determining phase information is single wavelength 
anomalous dispersion or SAD. In this technique, X-ray diffraction data are collected at a 
single wavelength from a single native or heavy-atom derivative crystal, and phase 
information is extracted using anomalous scattering information from atoms such as sulfur 
or chlorine in the native crystal or from the heavy atoms in the heavy-atom derivative 
crystal. The wavelength of X-rays used to collect data for this phasing technique need not 
be close to the absorption edge of the anomalous scatterer. A detailed discussion of SAD 
analysis may be found in Brodersen, et al., Acta Cryst., D56:431-41, 2000. 
[0188] A fifth method of determining phase information is single isomorphous 
replacement with anomalous scattering or SIRAS. SIRAS combines isomorphous 
replacement and anomalous scattering techniques to provide phase information for a 
crystal of a polypeptide. X-ray diffraction data are collected at a single wavelength, 
usually from both a native and a single heavy-atom derivative crystal. Phase information 
obtained only from the location of the heavy atoms in a single heavy-atom derivative 
crystal leads to an ambiguity in the phase angle, which is resolved using anomalous 
scattering from the heavy atoms. Phase information is extracted from both the location of 
the heavy atoms and from anomalous scattering of the heavy atoms. A detailed discussion 
of SIRAS analysis may be found in North, Acta Cryst. 18:212-16, 1965; Matthews, Acta 
Cryst. 20:82-86, 1966; Methods in Enzymology 276:530-37, 1997. 
[0189] Once phase information is obtained, it is combined with the diffraction data to 
produce an electron density map, an image of the electron clouds surrounding the atoms 
that constitute the molecules in the unit cell. The higher the resolution of the data, the 
more distinguishable the features of the electron density map, because atoms that are 
closer together are resolvable. A model of the macromolecule is then built into the 
electron density map with the aid of a computer, using as a guide all available information, 
such as the polypeptide sequence and the established rules of molecular structure and 
stereochemistry. Interpreting the electron density map is a process of finding the 
chemically reasonable conformation that fits the map precisely. 

[0190] After a model is generated, a structure is refined. Refinement is the process of 

minimizing the function <|>, which is the difference between observed and calculated 

intensity values (measured by an R-factor), and which is a function of the position, 
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temperature factor, and occupancy of each non-hydrogen atom in the model. This usually 
involves alternate cycles of real space refinement, Le. t calculation of electron density maps 
and model building, and reciprocal space refinement, i.e., computational attempts to 
improve the agreement between the original intensity data and intensity data generated 
from each successive model. Refinement ends when the function (J) converges on a 
minimum wherein the model fits the electron density map and is stereochemically and 
conformationally reasonable. During the last stages of refinement, ordered solvent 
molecules are added to the structure. 

Structures of PAK6KD 

[0191] The present invention provides, for the first time, the high-resolution three- 
dimensional structures and molecular structure coordinates of crystalline PAK6KD as 
determined by X-ray crystallography. 

[0192] Contemplated within the scope of the present invention are any set of structure 
coordinates obtained for crystals of PAK6KD, whether native crystals, heavy-atom 
derivative crystals or co-crystals, that have a root mean square deviation ("r.m.s.d.") of up 
to about or equal to 1.5 A, preferably 1.25 A, preferably 1 A, preferably 1 .75 A, and 
preferably 0.5 A when superimposed, using backbone atoms (N, C-a, C and O), or using 
C-a atoms, on the structure coordinates listed in Fig. 4 are considered to be within the 
scope of the present invention when at least 50% to 100% of the backbone atoms of 
PAK6KD are included in the superposition. The amino acid numbers in Figure 4 reflect 
the amino acid position in the expressed protein used to obtain the crystals of the present 
invention. Those of ordinary skill in the art may align the sequence with other sequences 
of PAK6KD or PAK6 to, if desired, correlate the amino acid residue number with the 
Figure 4 sequence or the wild type sequence numbers. 

Structure Coordinates 

[0193] The molecular structure coordinates may be used in molecular modeling and 
design, as described more fully below. The present invention encompasses the structure 
coordinates and other information, e.g., amino acid sequence, connectivity tables, vector- 
based representations, temperature factors, etc., used to generate the three-dimensional 
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structure of the polypeptide for use in the software programs described below and other 
software programs. 

[0194] The invention includes methods of producing computer readable databases 
comprising the three-dimensional molecular structure coordinates of certain molecules, 
including, for example, the PAK6KD structure coordinates, the structure coordinates of 
binding pockets or active sites of PAK6KD, or structure coordinates of compounds 
capable of binding to PAK6KD. The databases of the present invention may comprise any 
number of sets of molecular structure coordinates for any number of molecules, including, 
for examples, structure coordinates of one molecule. In other embodiments, the databases 
of the present invention may comprise structure coordinates of a compound or compounds 
that have been identified by virtual screening to bind to a PAK6 binding pocket, or other 
representations of such compounds such as, for example, a graphic representation or a 
name. By "database" is meant a collection of retrievable data. The invention 
encompasses machine readable media embedded with or containing information regarding 
the three-dimensional structure of a crystalline polypeptide and/or model, such as, for 
example, its molecular structure coordinates, described herein, or with subunits, domains, 
and/or, portions thereof such as, for example, portions comprising active sites, accessory 
binding sites, and/or binding pockets in either liganded or unliganded forms. 
Alternatively, the information may be that of identifiers which represent specific structures 
found in a protein. As used herein, "machine readable medium" refers to any medium that 
may be read and accessed directly by a computer or scanner. Such media may take many 
forms, including but not limited to, non-volatile, volatile and transmission media. Non- 
volatile media, i.e., media that can retain information in the absence of power, includes a 
ROM. Volatile media, i.e., media that cannot retain information in the absence of power, 
includes a main memory. Transmission media includes coaxial cables, copper wire and 
fiber optics, including the wires that comprise the bus. Transmission media can also take 
the form of carrier waves; i.e., electromagnetic waves that may be modulated, as in 
frequency, amplitude or phase, to transmit information signals. Additionally, transmission 
media can take the form of acoustic or light waves, such as those generated during radio 
wave and infrared data communications. 

[0195] Such media also include, but are not limited to: magnetic storage media, such 

as floppy discs, flexible discs, hard disc storage medium and magnetic tape; optical 

storage media such as optical discs or CD-ROM; electrical storage media such as RAM or 
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ROM, PROM (i.e., programmable read only memory), EPROM (i.e., erasable 
programmable read only memory), including FLASH-EPROM, any other memory chip or 
cartridge, carrier waves, or any other medium from which a processor can retrieve 
information, and hybrids of these categories such as magnetic/optical storage media. Such 
media further include paper on which is recorded a representation of the molecular 
structure coordinates, e.g.* Cartesian coordinates, that may be read by a scanning device 
and converted into a format readily accessed by a computer or by any of the software 
programs described herein by, for example, optical character recognition (OCR) software. 
Such media also include physical media with patterns of holes, such as, for example, 
punch cards, and paper tape. 

[0196] A variety of data storage structures are available for creating a computer 
readable medium having recorded thereon the molecular structure coordinates of the 
invention or portions thereof and/or X-ray diffraction data. The choice of the data storage 
structure will generally be based on the means chosen to access the stored information. In 
addition, a variety of data processor programs and formats may be. used to store the 
sequence and X-ray data information on a computer readable medium. Such formats 
include, but are not limited to, macromolecular Crystallographic Information File 
("mmCIF") and Protein Data Bank ("PDB") format (Research Collaborator for Structural 
Bioinformatics; www.rcsb.org; Cambridge Crystallographic Data Centre format 
(www.ccdc.can.ac.uk/support/csd_doc/volume3/z323.html); Structure-data ("SD") file 
format (MDL Information Systems, Inc.; Dalby, et al. f J. Chem. Inf. Comp. Sci., 32:244- 
55, 1992; and line-notation, e.g., as used in SMILES (Weininger, J. Chem. Inf. Comp. Sci. 
28:31-36, 1988). Methods of converting between various formats read by different 
computer software will be readily apparent to those of skill in the art, e.g., BABEL (v. 
1.06, Walters & Stahl, ©1992, 1993, 1994; 

www.brunel.ac.uk/departments/chem/babel.htm). All format representations of the 
polypeptide coordinates described herein, or portions thereof, are contemplated by the 
present invention. By providing computer readable medium having stored thereon the 
atomic coordinates of the invention, one of skill in the art can routinely access the atomic 
coordinates of the invention, or portions thereof, and related information for use in 
modeling and design programs, described in detail below. 

[0197] A computer may be used to display the structure coordinates or the three- 
dimensional representation of the protein or peptide structures, or portions thereof, such 
Express Mail No. EV3 1 5 1 35696US 49 



Docket No. PT145/01.PRO 

as, for example, portions comprising active sites, accessory binding sites, and/or binding 
pockets, in either liganded or unliganded form, of the present invention. The term 
"computer" includes, but is not limited to, mainframe computers, personal computers, 
portable laptop computers, and personal data assistants ("PDAs") which can store data and 
independently run one or more applications, i.e., programs. The computer may include, 
for example, a machine readable storage medium of the present invention, a working 
memory for storing instructions for processing the machine-readable data encoded in the 
machine readable storage medium, a central processing unit operably coupled to the 
working memory and to the machine readable storage medium for processing the machine 
readable information, and a display operably coupled to the central processing unit for 
displaying the structure coordinates or the three-dimensional representation. The 
information contained in the machine-readable medium may be in the form of, for 
example, X-ray diffraction data, structure coordinates, electron density maps, or ribbon 
structures. The information may also include such data for co-complexes between a 
compound and a protein or peptide of the present invention. 

[0198] The computers of the present invention may also include, for example, a central 

processing unit, a working memory which may be, for example, random-access memory 

(RAM) or "core memory," mass storage memory (for example, one or more disk drives or 

CD-ROM drives), one or more cathode-ray tube ("CRT') display terminals or one or more 

LCD displays, one or more keyboards, one or more input lines, and one or more output 

lines, all of which are interconnected by a conventional bi-directional system bus. 

Machine-readable data of the present invention may be inputted and/or outputted through 

a modem or modems connected by a telephone line or a dedicated data line (either of 

which may include, for example, wireless modes of communication). The input hardware 

may also (or instead) comprise CD-ROM drives or disk drives. Other examples of input 

devices are a keyboard, a mouse, a trackball, a finger pad, or cursor direction keys. Output 

hardware may also be implemented by conventional devices. For example, output 

hardware may include a CRT, or any other display terminal, a printer, or a disk drive. The 

CPU coordinates the use of the various input and output devices, coordinates data accesses 

from mass storage and accesses to and from working memory, and determines the order of 

data processing steps. The computer may use various software programs to process the 

data of the present invention. Examples of many of these types of software are discussed 

throughout the present application. 
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[0199] Those of skill in the art will recognize that a set of structure coordinates is a 
relative set of points that define a shape in three dimensions. Therefore, two different sets 
of coordinates could define the identical or a similar shape. Also, minor changes in the 
individual coordinates may have very little effect on the peptide's shape. Minor changes 
in the overall structure may have very little to no effect, for example, on the binding 
pocket, and would not be expected to significantly alter the nature of compounds that 
might associate with the binding pocket. 

[0200] Although Cartesian coordinates are important and convenient representations of 
the three-dimensional structure of a polypeptide, other representations of the structure are 
also useful. Therefore, the three-dimensional structure of a polypeptide, as discussed 
herein, includes not only the Cartesian coordinate representation, but also all alternative 
representations of the three-dimensional distribution of atoms. For example, atomic 
coordinates may be represented as a Z-matrix, wherein a first atom of the protein is 
chosen, a second atom is placed at a defined distance from the first atom, and a third atom 
is placed at a defined distance from the second atom so that it makes a defined angle with 
the first atom. Each subsequent atom is placed at a defined distance from a previously 
placed atom with a specified angle with respect to the third atom, and at a specified torsion 
angle with respect to a fourth atom. Atomic coordinates may also be represented as a 
Patterson function, wherein all interatomic vectors are drawn and are then placed with 
their tails at the origin. This representation is particularly useful for locating heavy atoms 
in a unit cell. In addition, atomic coordinates may be represented as a series of vectors 
having magnitude and direction and drawn from a chosen origin to each atom in the 
polypeptide structure. Furthermore, the positions of atoms in a three-dimensional 
structure may be represented as fractions of the unit cell (fractional coordinates), or in 
spherical polar coordinates. 

[0201] Additional information, such as thermal parameters, which measure the motion 
of each atom in the structure, chain identifiers, which identify the particular chain of a 
multi-chain protein in which an atom is located, and connectivity information, which 
indicates to which atoms a particular atom is bonded, is also useful for representing a 
three-dimensional molecular structure. 

[0202] The structural information of a compound that binds a PAK6KD of the 
invention may be similarly stored and transmitted as described above for structural 
information of PAK6KD. 
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Uses of the Molecular Structure Coordinates 

[0203] Structure information, typically in the form of molecular structure coordinates, 
may be used in a variety of computational or computer-based methods to, for example, 
design, screen for, and/or identify compounds that bind the crystallized polypeptide or a 
portion or fragment thereof, or to intelligently design mutants that have altered biological 
properties. 

[0204] When designing or identifying compounds that may associate with a given 
protein, binding pockets are often analyzed. The term "binding pocket," refers to a region 
of a protein that, because of its shape, likely associates with a chemical entity or 
compound. A binding pocket may be the same as an active site. A binding pocket of a 
protein is usually involved in associating with the protein's natural ligands or substrates, 
and is often the basis for the protein's activity. A binding pocket may refer to an active 
site. Many drugs act by associating with a binding pocket of a protein. A binding pocket 
may comprise amino acid residues that line the cleft of the pocket. Those of ordinary skill 
in the art will recognize that the numbering system used for other isoforms of PAK6KD 
may be different, but that the corresponding amino acids may be determined with a 
homology software program known to those of ordinary skill in the art. A binding pocket 
homolog comprises amino acids having structure coordinates that have a root mean square 
deviation from structure coordinates, as indicated in Fig. 4, of the binding pocket amino 
acids of up to about 1.5 A, preferably up to about 1.25 A, preferably up to about lA, 
preferably up to about 0.75 A, preferably up to about 0.5 A, and preferably up to about 
0.25A. 

[0205] Where a binding pocket or regulatory site is said to comprise amino acids 
having particular structure coordinates, the amino acids comprise the same amino acid 
residues, or may comprise amino acids having similar properties, as shown in, for 
example, Table 1, and have either the same relative three-dimensional structure 
coordinates as Fig. 4, or the group of amino acid residues named as part of the binding 
pocket have an rmsd of within 1.5A, preferably within 1.25 A, preferably within lA, 
preferably within 0.75A, preferably within 0.5 A, and preferably within 0.25A of the 
structure coordinates of Fig. 4. Preferably, when comparing the structure coordinates of 
the backbone atoms of the amino acid residues, the rmsd is within 1.5 A, preferably within 
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1.25 A, preferably within lA, preferably within 0.75 A, preferably within 0.5 A, and more 
preferably within 0.25A. 

[0206] Software applications are available to compare structures, or portions thereof, to 
determine if they are sufficiently similar to the structures of the invention such as DALI 
(Holm and Sander, /. Mol Biol. 233:123-38, 1993; (See European Bioinformatics Institute 
site at www.ebi.ac.uk/) ; MOE (Chemical Computing Group, Inc. Montreal, Quebec, 
Canada; and DEJAVU (Uppsala Software Factory; Kleywegt, G.S. & Jones, T.A., 
"Detecting Folding Motifs and Similarities in Protein Structure," Methods in Enzymology, 
277:525-45, 1997). 

[0207] The crystals and structure coordinates obtained therefrom may be used for 
rational drug design to identify and/or design compounds that bind PAK6 as an approach 
towards developing new therapeutic agents. For example, a high resolution X-ray 
structure of, for example, a crystallized protein saturated with solvent, will often show the 
locations of ordered solvent molecules around the protein, and in particular at or near 
putative binding pockets of the protein. This information can then be used to design 
molecules that bind these sites, the compounds synthesized and tested for binding in 
biological assays (Travis, Science, 262:1374, 1993). 

[0208] The structure may also be computationally screened with a plurality of 

molecules to determine their ability to bind to the PAK6KD at various sites. Such 

compounds may be used as targets or leads in medicinal chemistry efforts to identify, for 

example, inhibitors of potential therapeutic importance (Travis, Science, 262:131 r 4, 1993). 

The three dimensional structures of such compounds may be superimposed on a three 

dimensional representation of PAK6KD or an active site or binding pocket thereof to 

assess whether the compound fits spatially into the representation and hence the protein. 

Structural information produced by such methods and concerning a compound that fits (or 

a fitting portion of such a compound) may be stored in a machine readable medium. 

Alternatively, one or more identifiers of a compound that fits, or a fitting portion thereof, 

may be stored in a machine readable medium. Examples of identifiers include chemical 

name or abbreviation, chemical or molecular formula, chemical structure, and/or other 

identifying information. As an non-limiting example, if the three dimensional structure of 

phenol is found to fit the active site of PAK6KD, the structural information of phenol, or 

the portion that fits, may be stored for further use. Alternatively, an identifier of phenol, 

or of the portion that fits, such as the -OH group, may be stored for further use. Other 
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identifying information for phenol may also be used to represent it. All storage of 
information concerning a compound that fits may optionally be in combination with one or 
more pieces of information concerning PAK6KD. 

[0209] In an analogous manner, the structure of PAK6KD or an active site or binding 
pocket thereof may be used to computationally screen small molecule databases for 
chemical entities or compounds that can bind in whole, or in part, to PAK6. In this 
screening, the quality of fit of such entities or compounds to the binding pocket may be 
judged either by shape complementarity or by estimated interaction energy (Meng, et a/., 
J. Comp. Chem. 13:505-24, 1992). 

[0210] In still another embodiment, compounds may be developed that are analogues 
of natural substrates, reaction intermediates or reaction products of PAK6. The reaction 
intermediates of PAK6 may be deduced from the substrates, or reaction products in 
co-complex with PAK6KD. The binding of substrates, reaction intermediates, and 
reaction products may change the conformation of the binding pocket, which provides 
additional information regarding binding patterns of potential ligands, activators, 
inhibitors, and the like. Such information is also useful to design improved analogues of 
known PAK6 inhibitors or to design novel classes of inhibitors based on the substrates, 
reaction intermediates, and reaction products of PAK6KD and PAK6KD-inhibitor 
co-complexes. This provides a novel route for designing PAK6KD inhibitors with both 
high specificity and stability. 

[0211] Another method of screening or designing compounds that associate with a 

binding pocket includes, for example, computationally designing a negative image of the 

binding pocket. This negative image may be used to identify a set of pharmacophores. A 

pharmacophore may be a description of functional groups and how they relate to each 

other in three-dimensional space. This set of pharmacophores may be used to design 

compounds and screen chemical databases for compounds that match with the 

pharmacophore(s). Compounds identified by this method may then be further evaluated 

computationally or experimentally for binding activity. Various computer programs may 

be used to create the negative image of the binding pocket, for example; GRID (Goodford, 

J. Med. Chem. 28:849-57, 1985; GRID is available from Oxford University, Oxford, UK); 

MCSS (Miranker & Karplus, Proteins: Structure, Function and Genetics 11:29-34, 1991; 

MCSS is available from Accelrys, Inc., San Diego, CA); LUDI (Bohm, J. Comp. Aid. 

Molec. Design 6:61-78, 1992; LUDI is available from Accelrys, Inc., San Diego, CA); 
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DOCK (Kuntz et al.; /. Mol Biol 161:269-88, 1982; DOCK is available from University 
of California, San Francisco, CA); DOCKIT (Metaphorics, Mission Viejo, CA) and MOE. 
Other appropriate programs are desribed in, for example, Halperin, et al., Proteins 47(4): 
409-43 (2002). 

[0212] Thus, among the various embodiments of the present invention are methods of 
identifying, screening, and designing compounds that associate with a binding pocket of 
PAK6KD. 

[0213] The design of compounds that bind to and/or modulate PAK6, for example that 
inhibit or activate PAK6 according to this invention generally involves consideration of 
two factors. First, the compound must be capable of physically and structurally 
associating, either covalently or non-covalently with PAK6. For example, covalent 
interactions may be important for designing irreversible or suicide inhibitors of a protein. 
Non-covalent molecular interactions important in the association of PAK6 with the 
compound include hydrogen bonding, ionic interactions and van der Waals and 
hydrophobic interactions. Second, the compound must be able to assume a conformation 
and orientation in relation to the binding pocket, that allows it to associate with PAK6. 
Although certain portions of the compound will not directly participate in this association 
with PAK6, those portions may still influence the overall conformation of the molecule 
and may have a significant impact on potency. Conformational requirements include the 
overall three-dimensional structure and orientation of the chemical group or compound in 
relation to all or a portion of the binding pocket, or the spacing between functional groups 
of a compound comprising several chemical groups that directly interact with PAK6. 
[0214] Computer modeling techniques may be used to assess the potential modulating 
or binding effect of a chemical compound on PAK6KD. If computer modeling indicates a 
strong interaction, the molecule may then be synthesized and tested for its ability to bind 
to PAK6 and affect (by inhibiting or activating) its activity. 

[0215] Modulating or other binding compounds of PAK6 may be computationally 

evaluated and designed by means of a series of steps in which chemical groups or 

fragments are screened and selected for their ability to associate with the individual 

binding pockets or other areas of PAK6. Several methods are available to screen chemical 

groups or fragments for their ability to associate with PAK6. This process may begin by 

visual inspection of, for example, the active site on the computer screen based on the 

PAK6KD coordinates. Selected fragments or chemical groups may then be positioned in 
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a variety of orientations, or docked, within an individual binding pocket of PAK6KD 
(Blaney, J.M. and Dixon, J.S., Perspectives in Drug Discovery and Design, 1:301, 1993). 
Manual docking may be accomplished using software such as Insight II (Accelrys, San 
Diego, CA) MOE; CE (Shindyalov, IN, Bourne, PE, "Protein Structure Alignment by 
Incremental Combinatorial Extension (CE) of the Optimal Path," Protein Engineering, 
1 1 :739-47, 1998); and SYB YL (Molecular Modeling Software, Tripos Associates, Inc., St. 
Louis, MO, 1992), followed by energy minimization and molecular dynamics with 
standard molecular mechanics force fields, such as CHARMM (Brooks, et al, 7. Comp. 
Chem. 4:187-217, 1983). More automated docking may be accomplished by using 
programs such as DOCK (Kuntz et al, /. Mol Biol, 161:269-88, 1982; DOCK is 
available from University of California, San Francisco, CA); AUTODOCK (Goodsell & 
Olsen, Proteins: Structure, Function, and Genetics 8:195-202, 1990; AUTODOCK is 
available from Scripps Research Institute, La Jolla, CA); GOLD (Cambridge 
Crystallographic Data Centre (CCDC); Jones et al., /. Mol Biol 245:43-53, 1995); and 
FLEXX (Tripos, St. Louis, MO; Rarey, M., et al., J. Mol Biol 261:470-89, 1996); 
AMBER (Weiner, et al, J. Am. Chem. Soc. 106:765-84, 1984) and C 2 MMFF (Merck 
Molecular Force Field; Accelrys, San Diego, CA). Other appropriate programs are 
described in, for example, Halperin, et al. 

[0216] Specialized computer programs may also assist in the process of selecting 
fragments or chemical groups. These include DOCK; GOLD; LUDI; FLEXX (Tripos, St. 
Louis, MO; Rarey, M., et al., J. Mol Biol 261:470-89, 1996); and GLIDE (Eldridge, et 
al., /. Comput. Aided Mol Des. 11:425-45, 1997; SchrSdinger, Inc., New York). Other 
appropriate programs are described in, for example, Halperin, et al., (Portland, OR). 
[0217] Once suitable chemical groups or fragments have been selected, they may be 
assembled into a single compound or inhibitor. Assembly may proceed by visual 
inspection of the relationship of the fragments to each other in the three-dimensional 
image displayed on a computer screen in relation to the structure coordinates of 
PAK6KD. This would be followed by manual model building using software such as 
SYBYL, (Tripos, St. Louis, MO); Insight U (Accelrys, San Diego, CA); and MOE 
(Chemical Computing Group, Inc., Montreal, Canada). Other appropriate program are 
described in, for example, Halperin, et al. 

[0218] Useful programs to aid one of skill in the art in connecting the individual 

chemical groups or fragments include, for example: 
Express Mail No. EV3 1 5 1 35696US 56 



Docket No. PT145/01.PRO 

1 . CAVEAT (Bartlett et al , 'CAVEAT: A Program to Facilitate the 
Structure-Derived Design of Biologically Active Molecules'. In Molecular Recognition in 
Chemical and Biological Problems', Special Pub., Royal Chem. Soc. 78:182-96, 1989). 
CAVEAT is available from the University of California, Berkeley, CA. 

2. 3D Database systems such as ISIS or MACCS-3D (MDL Information 
Systems, San Leandro, Calif.). This area is reviewed in Martin, J. Med. Chem. 
35:2145-54, 1992). 

3. HOOK (Eisen et al, Proteins: Struct., Funct., Genet, 19: 199-221, 1994) 
(available from Accelrys, Inc., San Diego, CA). 

4. LUDI (Bohm, J. Comp. Aid. Molec. Design 6:61-78, 1992). LUDI is 
available from Accelrys, Inc., San Diego, CA. 

[0219] Instead of proceeding to build a PAK6 inhibitor in a step-wise fashion one 
fragment or chemical group at a time, as described above, PAK6 binding compounds may 
be designed as a whole or *de novo 1 using either an empty active site or optionally 
including some portion(s) of a known inhibitor(s). These methods include, for example: 

1. LUDI (Bohm, J. Comp. Aid. Molec. Design 6:61-78, 1992). LUDI is 
available from Accelrys, Inc., San Diego, CA. 

2. LEGEND (Nishibata & Itai, Tetrahedron, 47:8985, 1991). LEGEND is 
available from Accelrys, Inc., San Diego, CA. 

3. LeapFrog (available from Tripos, Inc., St. Louis, Mo.). 

4. SPROUT (Gillet et al, J. Comput. Aided Mol. Design 7: 127-53, 1993) 
(available from the University of Leeds, U.K.). 

5. GenStar (Murcko, M.A. and Rotstein, S.H. J. Comput. Aided Mol. Des. 
7:23-43, 1993). 

6. GroupBuild (Rotstein, S.H., and Murcko, M. A., J. Med. Chem. 36: 1700, 

1993). 

7. GrowMol (Rich, D.H. et al., Chimia, 51:45, 1997). 

8. Grow (UpJohn; Moon J, Howe W, Proteins, 1 1:314-28, 1991). 

9. SmoG (DeWitte, R.S., Abstr. Pap Am Chem. S. 214:6-Comp Part 1, Sept. 
7, 1997; DeWitte, R.S. & Shakhnovich, E.I., J. Am. Chem. Soc. 118:11733-44, 1996). 

10. LigBuilder (PDB (www.rcsb.org/pdb); Wang R, Ying G, Lai L, J. Mol. 
Model. 6: 498-516, 1998). 
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[0220] Other molecular modeling techniques may also be employed in accordance with 
this invention. See, e.g., Cohen et al, J. Med. Chem. 33:883-94, 1990. See also, Navia & 
Murcko, Current Opinions in Structural Biology 2:202-10, 1992; Balbes et al, Reviews in 
Computational Chemistry, 5:337-80, 1994, (Lipkowitz and Boyd, Eds.) (VCH, New 
York); Guida, Curr. Opin. Struct. Biol. 4:777-81, 1994. 

[0221] During design and selection of compounds by the above methods, the efficiency 
with which that compound may bind to PAK6KD may be tested and optimized by 
computational evaluation. For example, a compound that has been designed or selected to 
function as a PAK6 inhibitor may occupy a volume not overlapping the volume occupied 
by the active site residues when the native substrate is bound, however, those of ordinary 
skill in the art will recognize that there is some flexibility, allowing for rearrangement of 
the main chains and the side chains. In addition, one of ordinary skill may design 
compounds that could exploit protein rearrangement upon binding, such as, for example, 
resulting in an induced fit. An effective PAK6 inhibitor may demonstrate a relatively 
small difference in energy between its bound and free states (i.e., it must have a small 
deformation energy of binding and/or low conformational strain upon binding). Thus, the 
most efficient PAK6 inhibitors should, for example, be designed with a deformation 
energy of binding of not greater than 10 kcal/mol, for example, not greater than 7 
kcal/mol, for example, not greater than 5 kcal/mol and, for example, not greater than 2 
kcal/mol. PAK6 inhibitors may interact with the protein in more than one conformation 
that is similar in overall binding energy. In those cases, the deformation energy of binding 
is taken to be the difference between the energy of the free compound and the average 
energy of the conformations observed when the inhibitor binds to the enzyme. 
[0222] A compound selected or designed for binding to PAK6KD may be further 
computationally optimized so that in its bound state it would, for example, lack repulsive 
electrostatic interaction with the target protein. Non-complementary electrostatic 
interactions include repulsive charge-charge, dipole-dipole and charge-dipole interactions. 
Specifically, the sum of all electrostatic interactions between the inhibitor and the protein 
when the inhibitor is bound to it may make a neutral or favorable contribution to the 
enthalpy of binding. 

[0223] Specific computer software is available in the art to evaluate compound 

deformation energy and electrostatic interaction. Examples of programs designed for such 

uses include: Gaussian 94, revision C (Frisch, Gaussian, Inc., Pittsburgh, PA. ©1995); 
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AMBER, version 7 (Kollman, University of California at San Francisco, ©2002); 
QUANT A/CHARMM (Accelrys, Inc., San Diego, CA, ©1995); Insight II/Discover 
(Accelrys, Inc., San Diego, CA, ©1995); DelPhi (Accelrys, Inc., San Diego, CA, ©1995); 
and AMSOL (University of Minnesota) (Quantum Chemistry Program Exchange, Indiana 
University). These programs may be implemented, for instance, using a computer 
workstation, as are well known in the art, for example, a LINUX, SGI or Sun workstation. 
Other hardware systems and software packages will be known to those skilled in the art. 
[0224] Once a PAK6KD binding compound has been optimally selected or designed, 
as described above, substitutions may then be made in some of its atoms or chemical 
groups in order to improve or modify its binding properties. Generally, initial 
substitutions are conservative, i.e., the replacement group will have approximately the 
same size, shape, hydrophobicity and charge as the original group. One of skill in the art 
will understand that substitutions known in the art to alter conformation should be 
avoided. Such altered chemical compounds may then be analyzed for efficiency of 
binding to PAK6KD by the same computer methods described in detail above. Methods 
of structure-based drug design are described in, for example, Klebe, G., J. Mol. Med. 
78:269-81, 2000); Hoi. W.G.J., Angewandte Chemie (Int'l Edition in English) 25:767-852, 
1986; and Gane, P.J. and Dean, P.M., Current Opinion in Structural Biology, 10:401-04, 
2000. 

[0225] The present invention also provides means for the preparation of a compound 
the structure of which has been identified or designed, as described above, as binding 
PAK6KD or an active site or binding pocket thereof. Where the compound is already 
known or designed, the synthesis thereof may readily proceed by means known in the art. 
Alternatively, compounds that match the structure of one or more pharmacophores as 
described above may be prepared by means known in the art. In an alternative 
embodiment, the production of a compound may proceed by introduction of one or more 
desired chemical groups by attachment to an initial compound which binds PAK6KD or 
an active site or binding pocket thereof and which has, or has been modified to contain, 
one or more chemical moieties for attachment of one or more desired chemical groups. 
The initial compound may be viewed as a "scaffold" comprising at least one moiety 
capable of binding or associating with one or more residues of PAK6KD or an active site 
or binding pocket thereof. 



Express Mail No. EV315135696US 



59 



Docket No. PT145/01.PRO 

[0226] The initial compound may be a flexible or rigid "scaffold", optionally 
containing a linker for introduction of additional chemical moieties. Various scaffold 
compounds may be used, including, but not limited to, aliphatic carbon chains, 
pyrrolidinones, sulfonamidopyrrolidinones, cycloalkanonedienes including 
cyclopentanonedienes, cyclohexanonedienes, and cyclopheptanonedienes, carbazoles, 
imidazoles, benzimidiazoles, pyridine, isoxazoles, isoxazolines, benzoxazinones, 
benzamidines, pyridinones and derivatives thereof. Other scaffolds are described in, for 
example, Klebe, G., 7. Mol Med. 78: 269-281 (2000); Maignan, S. and Mikol, V., Curr. 
Top. Med. Chem. 1: 161-174 (2001); and U.S. Patent No. 5,756,466 to Bemis et al The 
scaffold compound used may, for example, be one that comprises at least one moiety 
capable of binding or associating with one or more residues of PAK6KD or an active site 
or binding pocket thereof. 

[0227] Chemical moieties on the scaffold compound that permit attachment of one or 
more desired functional chemical groups may undergo conventional reactions by coupling, 
substitution, and electrophilic or nucleophilic displacement. For example, the moieties 
may be those already present on the compound or readily introduced. Alternatively, an 
variant of the scaffold compound comprising the moieties is utilized initially. As a non- 
limiting example, the moiety may be a leaving group which can readily be removed from 
the scaffold compound. Various moieties may be used, including but not limited to 
pyrophosphates, acetates, hydroxy groups, alkoxy groups, tosylates, brosylates, halogens, 
and the like. In another embodiment of the invention, the scaffold compound is 
synthesized from readily available starting materials using conventional techniques. (See 
e.g., U.S. Patent 5,756,466 for general synthetic methods). Chemical groups are then 
introduced into the scaffold compound to increase the number of interactions with one or 
more residues of PAK6KD or an active site or binding pocket thereof. 
[0228] Because PAK6KD may crystallize in more than one crystal form, the structure 
coordinates of PAK6KD, or portions thereof, are particularly useful to solve the structure 
of those other crystal forms of PAK6KD. They may also be used to solve the structure of 
PAK6KD mutants, PAK6KD co-complexes, or of the crystalline form of any other protein 
with significant amino acid sequence homology to any functional domain of PAK6KD. 
[0229] Homologs or mutants of PAK6KD may, for example, have an amino acid 
sequence homology to the Homo sapiens amino acid sequence of Fig. 2 of greater than 

60%, more preferred proteins have a greater than 70% sequence homology, more preferred 
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proteins have a greater than 80% sequence homology, more preferred proteins have a 
greater than 90% sequence homology, and most preferred proteins have greater than 95% 
sequence homology. A protein domain, region, or binding pocket may have a level of 
amino acid sequence homology to the corresponding domain, region, or binding pocket 
amino acid sequence of Homo sapiens of Fig. 2 of greater than 60%, more preferred 
proteins have a greater than 70% sequence homology, more preferred proteins have a 
greater than 80% sequence homology, more preferred proteins have a greater than 90% 
sequence homology, and most preferred proteins have greater than 95% sequence 
homology. Percent homology may be determined using, for example, a PSI BLAST 
search, such as, but not limited to version 2.1.2 (Altschul, S.F., et al., Nuc. Acids Rec. 
25:3389-3402, 1997). 

[0230] One method that may be employed for this purpose is molecular replacement. In 
this method, the unknown crystal structure, whether it is another crystal form of PAK6KD, 
a PAK6KD mutant, or a PAK6KD co-complex, or the crystal of some other protein with 
significant amino acid sequence homology to any functional domain of PAK6KD, may be 
determined using phase information from the PAK6KD structure coordinates. This 
method may provide an accurate three-dimensional structure for the unknown protein in 
the new crystal more quickly and efficiently than attempting to determine such 
information ab initio. In addition, in accordance with this invention, PAK6KD mutants 
may be crystallized in co-complex with known PAK6KD inhibitors. The crystal structures 
of a series of such complexes may then be solved by molecular replacement and compared 
with that of wild-type PAK6KD. Potential sites for modification within the various 
binding pockets of the protein may thus be identified. A co-crystal may be obtained, for 
example, by soaking a crystalline form of a target protein in the presence of at least one 
ligand. Or, a co-crystal may be obtained, for example, by crystallizing a co-complex, by 
preparing a solution comprising a target protein and a ligand, and then following an 
appropriate crystallization method. The ligand may be present in the mother liquor or, if it 
is insoluble in the mother liquor, it may be dissolved, at the highest concentration possible, 
in DMSO, for example. 

[0231] This information provides an additional tool for determining the most efficient 
binding interactions, for example, increased hydrophobic interactions, between PAK6KD 
and a chemical group or compound. 
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[0232] If an unknown crystal form has the same space group as and similar cell 
dimensions to the known PAK6KD crystal form, then the phases derived from the known 
crystal form may be directly applied to the unknown crystal form, and in turn, an electron 
density map for the unknown crystal form may be calculated. Difference electron density 
maps can then be used to examine the differences between the unknown crystal form and 
the known crystal form. A difference electron density map is a subtraction of one electron 
density map, e.g., that derived from the known crystal form, from another electron density 
map, e.g., that derived from the unknown crystal form. Therefore, all similar features of 
the two electron density maps are eliminated in the subtraction and only the differences 
between the two structures remain. For example, if the unknown crystal form is of a 
PAK6KD co-complex, then a difference electron density map between this map and the 
map derived from the native, uncomplexed crystal will ideally show only the electron 
density of the ligand. Similarly, if amino acid side chains have different conformations in 
the two crystal forms, then those differences will be highlighted by peaks (positive 
electron density) and valleys (negative electron density) in the difference electron density 
map, making the differences between the two crystal forms easy to detect. However, if the 
space groups and/or cell dimensions of the two crystal forms are different, then this 
approach will not work and molecular replacement must be used in order to derive phases 
for the unknown crystal form. 

[0233] All of the complexes referred to above may be studied using well-known X-ray 

diffraction techniques and may be refined against data extending from about 500A to at 

least 3.0A or 1.5A, until the refinement has converged to limits accepted by those skilled 

in the art, such as, but not limited to, R=0.2, Rfree=0.25. This may be determined using 

computer software, such as X-PLOR, CNX, or refmac (part of the CCP4 suite; 

Collaborative Computational Project, Number 4, "The CCP4 Suite: Programs for Protein 

Crystallography," Acta Cryst. D50, 760-63, 1994). See, e.g., Blundell et al ., Protein 

Crystallography, Academic Press; Methods in Enzymology, Vols. 1 14 & 115, 1976; 

Wyckoff et al, eds., Academic Press, 1985; Methods in Enzymology, Vols. 276 and 277 

(Carter & Sweet, eds., Academic Press 1997); "Application of Maximum Likelihood 

Refinement" G. Murshudov, A.Vagin and E.Dodson, (1996) in the Refinement of Protein 

Structures, Proceedings of Daresbury Study Weekend; G.N. Murshudov, A.A.Vagin and 

EJ.Dodson, Acta Cryst. D53, 240-55, 1997; G.N.Murshudov, A.Lebedev, A.A.Vagin, 

K.S.Wilson and EJ.Dodson, Acta Cryst. Section D55, 247-55, 1999. See, e.g., Blundell et 
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al. t Protein Crystallography, Academic Press; Methods in Enzymology, Vols. 1 14 & 115, 
1976; Wyckoff et al, eds., Academic Press, Methods in Enzymology, Vols. 276 and 277, 
1985 (Carter & Sweet, eds., Academic Press 1997). This information may thus be used to 
optimize known classes of PAK6 inhibitors, and more importantly, to design and 
synthesize novel classes of PAK6 inhibitors. 

[0234] The structure coordinates of PAK6KD mutants will also facilitate the 
identification of related proteins or enzymes analogous to PAK6 in function, structure or 
both, thereby further leading to novel therapeutic modes for treating or preventing diseases 
or disorders in which PAK6 activity is implicated. 

[0235] Subsets of the molecular structure coordinates may be used in any of the above 
methods. Particularly useful subsets of the coordinates include, but are not limited to, 
coordinates of single domains, coordinates of residues lining an active site or binding 
pocket, coordinates of residues that participate in important protein-protein contacts at an 
interface, and alpha-carbon coordinates. For example, the coordinates of one domain of a 
protein that contains the active site may be used to design inhibitors that bind to that site, 
even though the protein is fully described by a larger set of atomic coordinates. Therefore, 
a set of atomic coordinates that define the entire polypeptide chain, although useful for 
many applications, do not necessarily need to be used for the methods described herein. 

EXAMPLES 
Example 1: Determination of PAK6 Structure 
[0236] The subsections below describe the production of a polypeptide comprising the 
Homo sapiens PAK6, and the preparation and characterization of diffraction quality 
crystals and heavy-atom derivative crystals. 

Example 1.1: Preparation of PAK6 Crystals 
[0237] Human liver cDNA is synthesized using a standard cDNA synthesis kit following 
the manufacturers' instructions. The template for the cDNA synthesis is mRNA isolated 
from JAR cells [ATCC HTB-144] using a standard RNA isolation kit. An open-reading 
frame for PAK6KD is amplified from the human liver cDNA by the polymerase chain 
reaction (PCR) using the following primers: 
[0238] Forward primer: ACACATGAGCAGTTCAAGG 
Reverse primer: CCTTTCGGTAGAGCTGGATC 
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The PCR product (873 base pairs expected) is electrophoresed on a 1% agarose gel in TBE 
buffer and the appropriate size band is excised from the gel and eluted using a standard gel 
extraction kit. 

[0239] The eluted DNA is ligated for five minutes at room temperature with 
topoisomerase into pSGX6, previously digested with BamHI and Hindm. The vector 
pSGX6 is a topoisomerase-activated modified version of pET26b (Novagen, Madison, 
Wisconsin) wherein the coding sequence for smt3 (Genbank entry U27233) from amino 
acids 1 to 121 has been inserted between the Ndel and BamHI sites (Bernier-Villamor, V., 
et al., Cell 108:345-356, 2002). In addition, the pSGX6 vector contains a gene coding for 
lambda phosphatase. 

[0240] The phosphatase co-expression plasmid was created by inserting the phosphatase 
gene from lambda bacteriophage into the above plasmid (Matsui T, et al., Biochem. 
Biophys. Res. Commun., 2001, 284:798-807). The phosphatase gene was amplified using 
PCR from template lambda bacteriophage DNA (HinDm digest, New England Biolabs) 
using the following oligonucleotide primers: 

Forward primer (PPfor): GCAGAGATCCGAATTCGAGCTCCGTCGACGGAT 
GGAGTGAAAGAGATGCGC 

Reverse primer (PPrev): GGTGGTGGTGCTCGAGTGCGGCCGCAA 
GCTTTCATCATGCGCCTTCTCCCTGTAC 

[0241] The PCR product (744 base pairs expected) was gel purified. The purified DNA 
and non-co-expression plasmid DNA were then digested with SacI and Xhol restriction 
enzymes. Both the digested plasmid and PCR product were then gel purified and ligated 
together for 8 hrs at 16° C with T4 DNA ligase and transformed into ToplO cells using 
standard procedures. The presence of the phosphatase gene in the co-expression plasmid 
was confirmed by sequencing. For standard molecular biology protocols followed here, 
see also, for example, the techniques described in Sambrook et al. y Molecular Cloning: A 
Laboratory Manual, Cold Spring Harbor Laboratory, NY, 2001, and Ausubel et al., 
Current Protocols in Molecular Biology, Greene Publishing Associates and Wiley 
Interscience, NY, 1989. 

[0242] The lambda phosphatase/PAK6KD co-expression plasmid contains both the 

PAK6KD and lambda phosphatase sequences under the control of the lac promoter, each 

with its own ribosome binding site. By cloning the phosphatase into the middle of the 

multiple cloning site, downstream of the target gene, convenient restriction sites are 
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available for subcloning the phosphatase into other plasmids. These sites include SacI, 
Sail and EcoRI between the kinase and phosphatase and HinDm, NotI and Xhol 
downstream of the phosphatase. 

[0243] The resulting sequence of the PAK6KD gene after being ligated into the vector, 
from the Shine-Dalgarno sequence through the stop site and the "original" Hindm, site is 
as follows: AAGGAGATATA CCATGGGCAGCA GCCATCATC ATCATCA 
TCACAGCAGCGGCCT GGTGCCGCGCGGCAGCCATA 

TGGCTAGCrSMT31TCCrORF1 . The PAK6KD expressed using this vector has an N- 
terminal methionine, then a 6 x His-tag followed by the smt3 fusion protein followed by 
the kinase domain of PAK6KD. 

[0244] Plasmids containing ligated inserts are transformed into chemically competent 
TOP10 cells. Colonies are then screened for inserts in the correct orientation and small 
DNA amounts are purified using a "miniprep" procedure from 2 ml cultures, using a 
standard kit, following the manufacturer's instructions. For standard molecular biology 
protocols followed here, see also, for example, the techniques described in Sambrook et 
al. y Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, NY, 
2001, and Ausubel et a/., Current Protocols in Molecular Biology, Greene Publishing 
Associates and Wiley Interscience, NY, 1989. 

[0245] The miniprep DNA is transformed into BL21(DE3)-Codon+RIL cells and plated 
onto petri dishes containing LB agar with 30 ^ig/ml of kanamycin and 34 |ig/ml of 
chloramphenicol. Isolated, single colonies are grown to mid-log phase and stored at - 
80°C in LB containing 15% glycerol. 

[0246] The PAK6KD fusion protein is over expressed in E. coli in the presence of 60 
mg/L selenomethionine as follows. Glycerol stocks are grown in LB (lOg/L tryptone, 
5g/L yeast extract, lOg/L NaCl) with 30 jxg/ml kanamycin and 34 |ig/ml chloramphenicol. 
The culture is grown to an OD600 of 0.6 to 1.0, then IPTG is added at a 0.4mM final 
concentration. The culture is allowed to ferment for 16hr at 20°C. The PAK6KD is 
purified as follows. Cells are collected by centrifugation, lysed in diluted cracking buffer 
(50 mM Tris HC1, pH 7.6, 0.1% Tween20) s and centrifuged to remove cell debris. The 
soluble fraction is purified over an IMAC column charged with nickel (Pharmacia, 
Uppsala, Sweden), and eluted under native conditions with a gradient of 20mM to SOOmM 
imidazole in 50mM Tris.pH7.8, lOmM methionine, 10% glycerol. Next, the PAK6KD 
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fusion protein is mixed with Ulpl protease at a concentration of 1: 10,000 in elution buffer 
and incubated overnight at 4°C (Bernier-Villamor, V., et al. f Cell, 108:345-56, 2002); 
Mossessova, E., and Lima, CD., Mol. Cell 5:865-76, 2000). The cleavage reaction takes 
place during dialysis against the IMAC starting buffer. The cleaved PAK6 fusion protein 
is passed over an IMAC column, charged with nickel, a second time. The cleaved PAK6 
is recovered from the flowthrough, whereas the Smt-fusion partner, the uncleaved protein, 
and the His-tagged Ulp protease remained bound to the column. The cleaved PAK6KD 
fusion protein is then further purified by gel filtration using a Superdex 200 preparative 
grade column equilibrated in GF5 buffer (lOmM HEPES, lOmM methionine, 500 mM 
NaCl, 5 mM DTT, and 10% glycerol). Fractions containing the purified PAK6KD kinase 
domain are pooled, concentrated to 8.5mg/ml, flash frozen and stored at -80°C. The 
protein obtained is 99% pure as judged by electrophoresis on SDS polyacrylamide gels. 
Mass spectroscopic analysis of the purified protein showed that it is predominantly singly 
phosphorylated. 

[0247] For crystals of Homo sapiens PAK6 from which the molecular structure 
coordinates of the invention are obtained, it has been found that a sitting drop containing 1 
^1 of PAK6 polypeptide, 14 mg/ml, in 10 mM Hepes pH 7.5, 150 mM NaCl, 5mM DTT 
lOmM methionine, ImM ADP, 2mM MgCl 2 , 10% glycerol, and reservoir solution: 15 
% (v/v) PEG 400, 200 mM sodium citrate dihydrate and 100 mM Tris-HCl, pH 8.5 in a 
sealed container containing lOO^iL reservoir solution, incubated at 21°C provides 
diffraction quality crystals. In one example, crystals first appeared within 24 hours, and 
matured within about 4 weeks. 

[0248]Other examples of methods of obtaining a crystal comprise the steps of: (a) mixing a 
volume of a solution comprising the PAK6 with a volume of a reservoir solution 
comprising a precipitant, such as, for example, polyethylene glycol; and (b) incubating the 
mixture obtained in step (a) over the reservoir solution in a closed container, under 
conditions suitable for crystallization until the crystal forms. At least 10% (v/v) of PEG 
400 is present in the reservoir solution. PEG 400 is, for example, present in a 
concentration up to about 20% (v/v). In another example, the concentration of PEG 400 is 
15% (v/v). The concentration of Tris-HCI is, for example, at least 25mM. The 
concentration of Tris-HCl is, for example, up to about 200mM. In another example, the 
concentration of Tris-HCl is lOOmM. The concentration of sodium citrate dihydrate is, for 
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example, at least lOOmM. The concentration of sodium citrate dihydrate is, for example, 
up to about 300mM. In another example, the concentration of sodium citrate dihydrate is 
200 mM. The reservoir solution has a pH of, for example, 8. The reservoir solution may, 
for example, have a pH up to about 9. In another example, the pH is about 8.5. The 
temperature is, for example, at least 4°C. The temperature may be, for example, up to 
about 30°C In another example, the temperature is 21°C. 

[0249] Those of ordinary skill in the art recognize that the drop and reservoir volumes 
may be varied within certain biophysical conditions and still allow crystallization. 

Example 1.2: Crystal Diffraction Data Collection 
[0250] The crystals are individually harvested from their trays and transferred to a 
cryoprotectant consisting of 80% reservoir solution plus 20% glycerol. After about 2 
minutes the crystal is collected and transferred into liquid nitrogen. The crystals are 
transferred in liquid nitrogen to the Advanced Photon Source (Argonne National 
Laboratory) where a native data set is collected. 

Example 1.3: Structure Determination 
[0251] X-ray diffraction data are indexed and integrated using the program MOSFLM 
(Collaborative Computational Project, Number 4, Acta. Cryst. D50, 760-63, 1994; 
www.ccp4.ac.uk/main.htmn and then merged using the program SCALA (Collaborative 
Computational Project, Number 4, Acta. Cryst D50, 760-63, 1994; 
www.ccp4.ac.uk/main.htmlV The subsequent conversion of intensity data to structure 
factor amplitudes is carried out using the program TRUNCATE (Collaborative 
Computational Project, Number 4, Acta. Cryst. D50, 760-763, 1994; 
www.ccp4.ac.uk/main.htmiy An initial model was obtained by molecular replacement 
using a structure of PAK4, for example, a structure having the coordinates of a structure 
described in Publication No. US-2003-0229453-A1, published on December 11, 2003. as a 
search model using the program EPMR (Kissinger, CR, et al., Acta Cryst., D55, 484-491, 
1999). The initial protein model was built into the resulting map using the program 
XTALVIEW/XFIT (McRee, D.E. J. Structural Biology, 125:156-65, 1993; available from 
CCMS (San Diego Super Computer Center) CCMS-request@sdsc.edu.). 
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[0252] This model is refined using the program REFMAC (Brunger et al., Acta Cryst. 
D53, 240-55, 2000; Collaborative Computational Project, Number 4, Acta. Cryst. D50, 
760-63, 1994; www.ccp4.ac.uk/main.htmn with interactive refitting carried out using the 
program XTALVIEW/XFIT (McRee, D.E. /. Structural Biology t 125:156-65, 1993; 
available from CCMS (San Diego Super Computer Center) CCMS-request@sdsc.edu\ 
The stereochemical quality of the atomic model is monitored using PROCHECK 
(Laskowski et al., /. AppL Cryst. 26, 283-91, 1993) and WHATCHECK (Vriend, G., J. 
Mol Graph 8:52-56, 1990; Hooft, R.W.W. et al., Nature 381:272, 1996) and the 
agreement of the model with the x-ray data is analyzed using SFCHECK (Collaborative 
Computational Project, Number 4, Acta. Cryst. D50, 760-63, 1994); 
www.ccp4.ac.uk/main.htmn . 



Table 1 Data Collection Statistics 



Space group 


P21 21 21 


Cell dimensions 


a = 59.24 A 




b = 66.72 A 




c = 97.21 A 




a =90° 




(3 = 90° 




Y = 90° 


Wavelength A. 


0.9794 A 


Overall Resolution limits 


21.6 A 




1.46 A 


Number of reflections collected 


241955 


Number of unique reflections 


64186 


Overall Redundancy of data 


3.6 


Overall Completeness of data 


98.4 % 


Completeness of data in last data shell 


90.9 % 


Overall R S ym 


0.124 


Rsym in last resolved shell 


1.793 


Overall I/sigma(I) 


6.4 


I/sigma(I) in last shell 


1.0 



Table 2 Model Refinement Statistics 



Model 








Total number of atoms 


2631 




Number of water molecules 


240 




Temperature factor for all atoms 


17.8 A z 




Matthews coefficient 


2.61 I 
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Corresponding solvent content 


52.5 % [ 


Refinement 








Resolution limits 


21.6 A 
1.46 A 




Number of reflections used 


64131 




with I > 1 sigma(I) 


62900 




with I > 3 sigma(I) 


32421 




Completeness 


95 % 




R-factor for all reflections 


0.196 




Correlation coefficient 


0.955 




Number of reflections above 2 
sigma(F) and resolution from 5.0 
A - high resolution limit 


61215 




used to calculate Rworking 


58117 




used to calculate Rfree 


3098 




R-factor without free reflections 


0.190 




R-factor for free reflections 


0.224 




Error in coordinates estimated by 
Luzzati plot 


0.187 A 


Validation 








Phi-Psi core region 


93.1 % 




Phi-Psi violations 

Residues in disallowed regions: 


1 


%bad 
contacts 


Short contact distances 


0 




RMSD from ideal bond length 


0.021 A 




RMSD from ideal bond angle 


1.88° 



Example 1.4: Structure Analyses 
[0253] Atomic superpositions are performed with MOE (available from Chemical 
Computing Group, Inc., Montreal, Quebec, Canada). Per residue solvent accessible 
surface calculations are done with GRASP (Nicholls et a/., "Protein folding and 
association: insights from the interfacial and thermodynamic properties of hydrocarbons," 
Proteins, 11:281-96, 1991). The electrostatic surface is calculated using a probe radius of 
1.4A. 

Example 2: Use of PAK6KP Coordinates for Inhibitor Design 
[0254] The coordinates of the present invention, including the coordinates of molecules 
comprising the binding pocket residues of Figure 4, as well as coordinates of homologs 
having a rmsd of the backbone atoms of preferably less than 1.5 A, more preferably less 
than 1.25 A, more preferably less than lA, more preferably less than 0.75 A, and more 
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preferably less than 0.5A from the coordinates of Figure 4, are used to design compounds, 
including inhibitory compounds, that associate with PAK6, or homologs of PAK6. Such 
compounds may associate with PAK6 at the active site, in a binding pocket, in an 
accessory binding pocket, or in parts or all of both regions. 

[0255] The process may be aided by using a computer comprising a computer readable 
database, wherein the database comprises coordinates of an active site, binding pocket, or 
accessory binding pocket of the present invention. The computer may be programmed, for 
example, with a set of machine-executable instructions, wherein the recorded instructions 
are capable of displaying a three-dimensional representation of PAK6, or portions thereof. 
The computer is used according to the methods described herein to design compounds that 
associate with PAK6, for example, at the active site or a binding pocket. 
[0256] A chemical compound library is obtained. The library may be purchased from a 
publicly available source such as, for example, ChemBridge (San Diego, California, 
www.chembridge.com), Available Chemical Database, or Asinex (Moscow 123182, 
Russia, www.asinex.com). A filter is used to retain compounds in the library that satisfy 
the Lipinski rule of five, which states that compounds are likely to have good absorption 
and permeation in biological systems and are more likely to be successful drug candidates 
if they meet the following criteria: five or fewer hydrogen-bond donors, ten or fewer 
hydrogen-bond acceptors, molecular weight less than or equal to 500, and a calculated 
logP less than or equal to 5. (Lipinski, C.A., et aL, Advanced Drug Delivery Reviews 23 
3-25 (1996)). 

[0257] This filter reduces the size of the compound library used to screen against the 
structure of the present invention. Docking programs described herein, such as, for 
example, DOCK, or GOLD, are used to identify compounds that bind to the active site 
and/or binding pocket. Compounds may be screened against more than one binding 
pocket of the protein structure, or more than one set of coordinates for the same protein, 
taking into account different molecular dynamic conformations of the protein. Consensus 
scoring may then be used to identify the compounds that are the best fit for the protein 
(Charifson, P.S. et al., J. Med. Chem. 42:5100-9 (1999)). Data obtained from more than 
one protein molecule structure may also be scored according to the methods described in 
Klingler et al., U.S. Utility Application, filed May 3, 2002, entitled "Computer Systems 
and Methods for Virtual Screening of Compounds." Compounds having the best fit are 
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then obtained from the producer of the chemical library, or synthesized, and used in 
binding assays and bioassays. 

[0258] The coordinates of the present invention are also used to determine 
pharmacophores. These pharmacophores may be designed after reviewing results from 
the use of a docking program, to determine the shape of the PAK6 pharmacophore. 
Alternatively, programs such as GRID are used to calculate the properties of a 
pharmacophore. Once the pharmacophore is determined, it may be used to screen 
chemical libraries for compounds that fit within the pharmacophore. 
[0259] The coordinates of the present invention are also used to identify substructures 
that interact with various portions of an active site or binding pocket of PAK6. Once a 
substructure, or set of substructures, is determined, it is used to screen a chemical library 
for compounds comprising the substructure or set of substructures. The identified 
compounds are then docked to, for example, the active site or binding pocket. 

Example 3: Bioassav 
[0260] The kinase assays may use various forms of PAK6KD and PAK6, including, 
for example, PAK6KD or the PAK6 molecule itself, or a portion thereof. 

Binding Assay: 

[0261] To measure the binding of a compound to PAK6 or PAK6KD, a test kit 
manufactured by Discoverx (Fremont, CA), ED-Staurosporine NSIP™ Enzyme Binding 
Assay Kit (see U.S. Patent No. 5,643,734) is used. 

PAK4 Kinase assay: 

[0262] To assay the kinase activity of PAK6 or PAK6KD, NIH 3T3 cells are 

transfected with either empty SRa expression vector or expression vectors containing HA- 

tagged PAK6 or PAK6KD. Cells are harvested in M2 buffer (Minden, A. et al^Science, 

266:1719-23, 1994) 48 h after transfection. Approximately 100 fig of cell extracts are 

mixed with anti-HA antibody and protein A-Sepharose and incubated 2 h to overnight at 

4°C. The immune complexes are washed twice with M2 buffer and twice in 20 mM 

HEPES, pH 7.5, and incubated in a kinase buffer containing 20 fiM ATP and 5 fid of [Y- 

32 P]ATP together with either 5 fig of histone H4 or MBP (Boehringer Mannheim) or no 

substrate, at 30°C for 20 min. The reaction is stopped by boiling in 4x SDS loading buffer. 
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Proteins are resolved by SDS-PAGE, and substrate phosphorylation and 
autophosphorylation are visualized by autoradiography. 

[0263] To measure autophosphorylation of purified PAK6 or PAK6KD, recombinant 
PAK6 or PAK6KD (2 fig bound to protein G-Sepharose conjugated with monoclonal glu- 
glu antibody) is washed once and incubated in 40 fil of kinase buffer (50 mM Tris-HCl, 
pH 7.5, 100 mM NaCI, 10 mM MgCl 2 , 1 mM MnCl 2 ) with 2 /ig of either Racl or 
Cdc42Hs, all previously loaded with GTP or GDP. The reaction is initiated by adding 
10 ill of kinase buffer containing 50 fiM ATP and 5 fid of [Y- 32 P] ATP and incubated for 
20 min at 30°C. The reaction is stopped by adding 10 /il of 5x SDS-PAGE sample buffer 
and boiling for 5 min. Samples are applied to a 14% SDS-PAGE gel and exposed to film. 
[0264] To measure modulation, activation, or inhibition of PAK6 or PAK6KD kinase 
activity, a test compound is added to the assay at a range of concentrations. Inhibitors 
may, for example, inhibit PAK6 or PAK6KD activity at an IC 50 under 100 yM, for 
example under 10 pM, for example, under 1 mM, in the nanomolar range, or, for example, 
in the sub-nanomolar range. 

Apoptosis Assay: 

[0265] The ability of PAK6or PAK6KD to protect cells against apoptosis and delay 
caspase activation is measured as described (Gnesutta, N., et aL, J. Biol. Chem., 276: 
14414-19, 2001). NIH3T3 cells are stably transfected with either control, wild-type, or 
mutant PAK6 or PAK6KD vector (Qu, J. et al., Mol Cell Biol, 10: 3523-33, 2001). To 
estimate the level of apoptosis and survival of stably transfected cell lines, equal numbers 
of cells are seeded in growth medium in 3.5-, 6-, or 10-cm plates. After 48 hours, cells are 
assayed for apoptosis induction following treatments with UV irradiation, tumor necrosis 
factor alpha (TNFa), and serum deprivation. For UV irradiation, cells are washed twice in 
phosphate-buffered saline (PBS). After removal of the PBS, cells are exposed to 50 J/m 2 
UV-light in a UV cross-linker (Fisher) followed by addition of fresh medium. For TNFa 
treatment, cells are washed once with fresh medium that was replaced by medium 
containing TNFa and cycloheximide (CHX) either alone or in combination at a 
concentration of 10 ng/ml and 10/xg/ml, respectively (CHX is used as a control to block 
the NF-kB-mediated survival pathway induced by TNFa). For serum deprivation 
experiments, cells are washed once with medium without serum, followed by addition of 
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fresh medium containing 0.1, 0.5, or 10% serum for 24 h. After stimulation, cells are 
collected at 1 h, 2 h, and 4 h intervals and fixed for flow cytometry analysis or used to 
prepare total cell extracts. To determine the percentage of cell death by flow cytometry 
analysis, cells with DNA content lower than the Gi peak (sub-Gi) are considered to be 
apoptotic. Caspase-3-like activity is examined by Western blot analysis of the caspase-3 
substrate PARP in equal amounts of cell lysate. Detection of the M T 85,000 proteolytic 
product of PARPis used as an indication of caspase activity. To examine nuclear 
condensation, cells stained with Hoechst 33258 are analyzed by fluorescence microscopy. 
Cells that displayed condensed chromatin and blebbed nuclei are considered apoptotic. To 
determine the amount of cell death in each clone, apoptotic cells are counted in the same 
number of viewing fields. To determine survival rates, cells are seeded in 6-well plates and 
treated as described above. At the indicated time point, the medium is aspirated, and 
floating cells are removed by washing twice with PBS. Attached cells are collected and 
counted. The survival rate is expressed in percentage of surviving cells in treated cells 
compared with the untreated control. 

[0266] To measure modulation, activation, or inhibition of PAK6KD, a test compound 
is added to the assay at a range of concentrations. Inhibitors may inhibit PAK6KD 
activity at an IC50 in the nanomolar range, and, for example, in the subnanomolar range. 
[0267] To measure modulation, activation, or inhibition of PAK6KD, a test compound 
is added to the assay at a range of concentrations. Preferred inhibitors inhibit PAK6KD 
activity at an IC50 in the nanomolar range, or in the subnanomolar range. 

Example 4: Formulation and Administration 
[0268] Pharmaceutical compositions comprising PAK6 modulators, such as inhibitors, 
are useful, for example, for treating diseases and conditions that may be alleviated by 
modulation of PAK6. Pharmaceutical compositions containing PAK6 effectors may also 
be used to modify the activity of human homologs of PAK6. 

[0269] In therapeutic and/or diagnostic applications, the compounds of the invention 
may be formulated for a variety of modes of administration, including systemic and 
topical or localized administration. Techniques and formulations generally may be found 
in Remington: The Science and Practice of Pharmacy (20 th ed.) Lippincott, Williams & 
Wilkins (2000). 
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[0270] The compounds according to the invention are effective over a wide dosage 
range. For example, in the treatment of adult humans, dosages from 0.01 to 1000 mg from 
0.5 to 100 mg, and from 1 to 50 mg per day, from 5 to 40 mg per day are examples of 
dosages that may be used. One example of a dosage is 10 to 30 mg per day. The exact 
dosage will depend upon the route of administration, the form in which the compound is 
administered, the subject to be treated, the body weight of the subject to be treated, and the 
preference and experience of the attending physician. 

[0271] Pharmaceutical^ acceptable salts are generally well known to those of ordinary 
skill in the art and may include, by way of example but not limitation, acetate, 
benzenesulfonate, besylate, benzoate, bicarbonate, bitartrate, bromide, calcium edetate, 
carnsylate, carbonate, citrate, edetate, edisylate, estolate, esylate, fumarate, gluceptate, 
gluconate, glutamate, glycollylarsanilate, hexylresorcinate, hydrabamine, hydrobromide, 
hydrochloride, hydroxynaphthoate, iodide, isethionate, lactate, lactobionate, malate, 
maleate, mandelate, mesylate, mucate, napsylate, nitrate, pamoate (embonate), 
pantothenate, phosphate/diphosphate, polygalacturonate, salicylate, stearate, subacetate, 
succinate, sulfate, tannate, tartrate, or teoclate. Other pharmaceutically acceptable salts 
may be found in, for example, Remington: The Science and Practice of Pharmacy (20 th 
ed.) Lippincott, Williams 8c Wilkins (2000). Preferred pharmaceutically acceptable salts 
include, for example, acetate, benzoate, bromide, carbonate, citrate, gluconate, 
hydrobromide, hydrochloride, maleate, mesylate, napsylate, pamoate (embonate), 
phosphate, salicylate, succinate, sulfate, or tartrate. 

[0272] Depending on the specific conditions being treated, such agents may be 

formulated into liquid or solid dosage forms and administered systemically or locally. The 

agents may be delivered, for example, in a timed- or sustained- low release form as is 

known to those skilled in the art. Techniques for formulation and administration may be 

found in Remington: The Science and Practice of Pharmacy (20 th ed.) Lippincott, 

Williams & Wilkins (2000). Suitable routes may include oral, buccal, sublingual, rectal, 

transdermal, vaginal, transmucosal, nasal or intestinal administration; parenteral delivery, 

including intramuscular, subcutaneous, intramedullary injections, as well as intrathecal, 

direct intraventricular, intravenous, intraperitoneal, intranasal, or intraocular injections. 

[0273] For injection, the agents of the invention may be formulated in aqueous 

solutions, such as in physiologically compatible buffers such as Hank's solution, Ringer's 

solution, or physiological saline buffer. For such transmucosal administration, penetrants 
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appropriate to the barrier to be permeated are used in the formulation. Such penetrants are 
generally known in the art. Use of pharmaceutical^ acceptable carriers to formulate the 
compounds herein disclosed for the practice of the invention into dosages suitable for 
systemic administration is within the scope of the invention. With proper choice of carrier 
and suitable manufacturing practice, the compositions of the present invention, in 
particular, those formulated as solutions, may be administered parenterally, such as by 
intravenous injection. The compounds may be formulated readily using pharmaceutical^ 
acceptable carriers well known in the art into dosages suitable for oral administration. 
Such carriers enable the compounds of the invention to be formulated as tablets, pills, 
capsules, liquids, gels, syrups, slurries, suspensions and the like, for oral ingestion by a 
patient to be treated. 

[0274] Pharmaceutical compositions suitable for use in the present invention include 
compositions wherein the active ingredients are contained in an effective amount to 
achieve its intended purpose. Determination of the effective amounts is well within the 
capability of those skilled in the art, especially in light of the detailed disclosure provided 
herein. 

[0275] In addition to the active ingredients, these pharmaceutical compositions may 
contain suitable phannaceutically acceptable carriers comprising excipients and auxiliaries 
which facilitate processing of the active compounds into preparations which may be used 
pharmaceutical^. The preparations formulated for oral administration may be in the form 
of tablets, dragees, capsules, or solutions. 

[0276] Pharmaceutical preparations for oral use may be obtained by combining the 
active compounds with solid excipients, optionally grinding a resulting mixture, and 
processing the mixture of granules, after adding suitable auxiliaries, if desired, to obtain 
tablets or dragee cores. Suitable excipients are, in particular, fillers such as sugars, 
including lactose, sucrose, mannitol, or sorbitol; cellulose preparations, for example, 
maize starch, wheat starch, rice starch, potato starch, gelatin, gum tragacanth, methyl 
cellulose, hydroxypropylmethyl-cellulose, sodium carboxymethyl-cellulose (CMC), 
and/or polyvinylpyrrolidone (PVP: povidone). If desired, disintegrating agents may be 
added, such as the cross-linked polyvinylpyrrolidone, agar, or alginic acid or a salt thereof 
such as sodium alginate. 

[0277] Dragee cores are provided with suitable coatings. For this purpose, concentrated 

sugar solutions may be used, which may optionally contain gum arabic, talc, 
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polyvinylpyrrolidone, carbopol gel, polyethylene glycol (PEG), and/or titanium dioxide, 
lacquer solutions, and suitable organic solvents or solvent mixtures. Dye-stuffs or 
pigments may be added to the tablets or dragee coatings for identification or to 
characterize different combinations of active compound doses. 

[0278] Pharmaceutical preparations that may be used orally include push-fit capsules 
made of gelatin, as well as soft, sealed capsules made of gelatin, and a plasticizer, such as 
glycerol or sorbitol. The push-fit capsules can contain the active ingredients in admixture 
with filler such as lactose, binders such as starches, and/or lubricants such as talc or 
magnesium stearate and, optionally, stabilizers. In soft capsules, the active compounds 
may be dissolved or suspended in suitable liquids, such as fatty oils, liquid paraffin, or 
liquid polyethylene glycols (PEGs). In addition, stabilizers may be added. 
[0279] The present invention is not to be limited in scope by the exemplified 
embodiments, which are intended as illustrations of single aspects of the invention. 
Indeed, various modifications of the invention in addition to those described herein will 
become apparent to those having skill in the art from the foregoing description and 
accompanying drawings. Such modifications are intended to fall within the scope of the 
invention. References cited throughout this application are examples of the level of skill 
in the art and are hereby incorporated by reference herein in their entirety, whether 
previously specifically incorporated or not. 
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1. A PAK6or PAK6KD protein, or a functional PAK6KD protein subunit, in 
crystalline form. 

2. The crystalline protein or functional protein subunit of claim 1, which is a 
heavy-atom derivative crystal. 

3. The crystalline protein or functional protein subunit of claim 2, in which 
PAK6KD protein is a mutant. 

4. The crystalline protein of claim 3, which is characterized by a set of 
structural coordinates that is substantially similar to the set of structural coordinates of Fig. 
4. 

5. A crystal comprising PAK6 protein and a ligand. 

6. A method of identifying a ligand that binds PAK6 protein, comprising; 

a) forming a co-crystal of a test ligand and PAK6 protein; 

b) analyzing said co-crystal using X-ray crystallography; and 

c) using said analysis to determine whether said test ligand binds 
PAK6 protein. 

7. The method of claim 6 wherein said co-crystal is obtained by soaking a PAK6 
protein crystal in a solution comprising said test ligand. 

8. The method of claim 7 wherein said co-crystal is obtained by co-crystallizing 
PAK6 protein in the presence of said test ligand. 

9. A machine-readable medium embedded with information that corresponds 
to a three-dimensional structural representation of a crystalline protein of claim 1. 

10. The machine-readable medium of claim 9, embedded with the molecular 

structural coordinates of Fig. 4, or at least 50% of the coordinates thereof. 
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1 1 . The machine-readable medium of claim 9, embedded with the molecular 
structural coordinates of Fig. 4, or at least 80% of the coordinates thereof. 

12. The machine-readable medium of claim 9, embedded with the molecular 
structural coordinates of a protein molecule comprising a PAK6KD protein binding 
pocket, wherein said binding pocket comprises at least three amino acids selected from the 
group consisting of Val421, Lys436, Glu452, Val465, Met481, Glu482, Leu484, Leu533, 
Ser543, Asp544, Gly414, Glu415, Gly416, Ser417, Thr418, Gly419, Ile420, Ala434, 
Met438, Phe483, Asp530, Leu423, Gln443, Arg445, Leu449, Gln485, Gly486, Gly487, 
Ala488, Asp491, Gly546, Phe547, Arg675, and Lys676 having the structural coordinates 
of Fig. 4, or by the structural coordinates of a binding pocket homolog, wherein said the 
root mean square deviation of the backbone atoms of the amino acid residues of said 
binding pocket and said binding pocket homolog is less than 2.0A. 

13. The machine-readable medium of claim 12, wherein said binding pocket 
comprises Val421, Lys436, Glu452, Val465, Met481, Glu482, Leu484, Leu533, Ser543, 
and Asp544 according to the sequence of Fig. 4. 

14. The machine-readable medium of claim 13, wherein said binding pocket 
further comprises Gly414, Glu415, Gly416, Ser417, Thr418, Gly419, Ue420, Ala434, 
Met438, Phe483, and Asp530 according to the sequence of Fig. 4. 

15. The machine-readable medium of claim 14, wherein said binding pocket 
further comprises Leu423, Gln443, Arg445, Leu449, Gln485, Gly486, Gly487, Ala488, 
Asp491, Gly546, Phe547, Arg675, and Lys676 according to the sequence of Fig. 4. 

16. A method of electronically transmitting all or part of the information stored 
in the machine-readable medium of claim 9. 

17. A method of producing a computer readable database comprising the three- 
dimensional molecular structural coordinates of a binding pocket of a PAK6KD protein, 
said method comprising 
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a) obtaining three-dimensional structural coordinates defining said protein or 
a binding pocket of said protein, from a crystal of said protein; and 

b) introducing said structural coordinates into a computer to produce a 
database containing the molecular structural coordinates of said protein or said binding 
pocket. 

18. The method of claim 17, wherein said binding pocket of said protein is part 
of a co-complex with at least one ligand. 

19. The method of claim 17 wherein said binding pocket comprises amino 
acids Val, Lys, Glu, Val, Met, Glu, Leu, Leu, Ser, and Asp. 

20. The method of claim 19 wherein said computer is capable of utilizing or 
displaying a three-dimensional molecular structure comprising said binding pocket using 
said structural coordinates. 

21. The method of claim 19 wherein said binding pocket further comprises 
amino acids corresponding to Gly, Glu, Gly, Ser, Thr, Gly, He, Ala, Met, Phe, and Asp. 

22. The method of claim 21 wherein said binding pocket further comprises 
amino acids corresponding to Leu, Gin, Arg, Leu, Gin, Gly, Gly, Ala, Asp, Gly, Phe, Arg, 
and Lys. 

23. The method of claim 17 wherein said binding pocket comprises a binding 
pocket defined by the structural coordinates of at least three amino acids selected from the 
group consisting of Val421, Lys436, Glu452, Val465, Met481, Glu482, Leu484, Leu533, 
Ser543, Asp544, Gly414, Glu415, Gly416, Ser417, Thr418, Gly419, Ile420, Ala434, 
Met438, Phe483, Asp530, Leu423, Gln443, Arg445, Leu449, Gln485, Gly486, Gly487, 
Ala488, Asp491, Gly546, Phe547, Arg675, and Lys676. 

24. The method of claim 23 wherein said binding pocket comprises Val421, 

Lys436, Glu452, Val465, Met481, Glu482, Leu484, Leu533, Ser543, and Asp544 

according to the sequence of Fig. 4. 
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25. The method of claim 24, wherein said binding pocket further comprises 
Gly414, Glu415, Gly416, Ser417, Thr418, Gly419, Ile420, Ala434, Met438, Phe483, and 
Asp530 according to the sequence of Fig. 4. 

26. The method of claim 25, wherein said binding pocket further comprises 
Leu423, Gln443, Arg445, Leu449, Gln485, Gly486, Giy487, Ala488, Asp491, Gly546, 
Phe547, Arg675, and Lys676 according to the sequence of Fig. 4. 

27. The method of claim 17, wherein said binding pocket comprises an active 

site. 

28. A computer readable database produced by claim 17. 

29. A method comprising electronic transmission of all or part of the computer 
readable database produced by claim 17. 

30. A method of producing a computer readable database comprising a 
representation of a compound capable of binding a binding pocket of a PAK6KD protein, 
said method comprising 

a) introducing into a computer program a computer readable database 
produced by claim 17; 

b) generating a three-dimensional representation of a binding pocket of said 
PAK6KD protein in said computer program; 

c) superimposing a three-dimensional model of at least one binding test 
compound on said representation of the binding pocket; 

d) assessing whether said test compound model fits spatially into the binding 
pocket of said PAK6KD protein; and 

e) storing a representation of a compound that fits into the binding pocket into 
a computer readable database. 

31. The method of claim 30 wherein in e), said representation is stored in the 

database produced by claim 17. 
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32. The method of claim 30, wherein said representation is selected from the 
group consisting of the compound's name, a chemical or molecular formula of the 
compound, a chemical structure of the compound, an identifier for the compound, and 
three-dimensional molecular structural coordinates of the compound. 

33. The method of claim 30, wherein said generating of a three-dimensional 
representation of the binding pocket comprises use of structural coordinates having a root 
mean square deviation of the backbone atoms of the amino acid residues of said binding 
pocket of less than 2.0A from the structural coordinates of the corresponding residues 
according to Fig. 4. 

34. The method of claim 30, wherein said at least one binding test compound is 
selected by a method selected from i) selecting a compound from a small molecule 
database, (ii) modifying a known inhibitor, substrate, reaction intermediate, or reaction 
product, or a portion thereof, of PAK6KD, (iii) assembling chemical fragments or groups 
into a compound, and (iv) de novo ligand design of said compound. 

35. The method of claim 30, wherein said assessing of whether a test 
compound model fits is by docking the model to said representation of said PAK6KD 
binding pocket and/or performing energy minimization. 

36. The method of claim 30 further comprising 

f) preparing a binding test compound represented in said computer readable 
database; 

g) contacting said compound in a binding assay with a protein comprising said 
PAK6KD protein binding pocket; 

h) determining whether said test compound binds to said protein in said assay; 

and 

i) introducing a representation of a compound that binds to said protein in 
said assay into a computer readable database. 
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37. The method of claim 35 wherein in i), said representation is stored in said 
database. 

38. The method of claim 35, wherein said representation is selected from the 
group consisting of the compound's name, a chemical formula of the compound, a 
chemical structure of the compound, an identifier for the compound, and three- 
dimensional molecular structural coordinates of the compound. 

39. A method of producing a computer readable database comprising a 
representation of a binding pocket of a PAK6KD protein in a co-crystal with a compound, 
said method comprising 

a) preparing a binding test compound represented in a computer readable 
database produced by claim 30; 

b) forming a co-crystal of said compound with a protein comprising a binding 
pocket of a PAK6KD protein; 

c) obtaining the structural coordinates of said binding pocket in said co- 
crystal; and 

d) introducing the structural coordinates of said binding pocket or said co- 
crystal into a computer-readable database. 

40. The method of claim 39, further comprising introducing the structural 
coordinates of said compound in said co-crystal into said database. 

41. The method of claim 30 wherein said binding pocket comprises amino 
acids Val, Lys, Glu, Val, Met, Glu, Leu, Leu, Ser, and Asp. 

42. The method of claim 41 wherein said computer is capable of utilizing or 
displaying a three-dimensional molecular structure of said binding pocket using said 
structural coordinates. 

43. The method of claim 42 wherein said binding pocket further comprises 
amino acids corresponding to Gly, Glu, Gly, Ser, Thr, Gly, lie, Ala, Met, Phe, and Asp. 
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44. The method of claim 43 wherein said binding pocket further comprises 
amino acids corresponding to Leu, Gin, Arg, Leu, Gin, Gly, Gly, Ala, Asp, Gly, Phe, Arg, 
and Lys. 

45. The method of claim 30 wherein said binding pocket comprises a binding 
pocket defined by the structural coordinates of at least three amino acids selected from the 
group consisting of Val421, Lys436, Glu452, Val465, Met481, Glu482, Leu484, Leu533, 
Ser543, Asp544, Gly414, Glu415, Gly416, Ser417, Thr418, Gly419, Ile420, Ala434, 
Met438, Phe483, Asp530, Leu423, Gln443, Arg445, Leu449, Gln485, Gly486, Gly487, 
Ala488, Asp491, Gly546, Phe547, Arg675, and Lys676. 

46. The method of claim 45 wherein said binding pocket comprises Val421, 
Lys436, Glu452, Val465, Met481, Glu482, Leu484, Leu533, Ser543, and Asp544 
according to the sequence of Fig. 4. 

47. The method of claim 46, wherein said binding pocket further comprises 
Gly414, Glu415, Gly416, Ser417, Thr418, Gly419, Ile420, Ala434, Met438, Phe483, and 
Asp530 according to the sequence of Fig. 4. 

48. The method of claim 47, wherein said binding pocket further comprises 
Leu423, Gln443, Arg445, Leu449, Gln485, Gly486, Gly487, Ala488, Asp491, Gly546, 
Phe547, Arg675, and Lys676 according to the sequence of Fig. 4. 

49. The method of claim 30, wherein said binding pocket comprises an active 

site. 

50. A computer readable database produced by claim 30. 

51. A method comprising electronic transmission of all or part of the computer 
readable database produced using the method of claim 17. 
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52. A method of modulating PAK6KD protein activity comprising contacting 
said PAK6KD with a compound, wherein said compound is represented in a database 
produced by the method of claim 30. 

53. A method of producing a compound comprising a three-dimensional 
molecular structure represented by the coordinates contained in a computer readable 
database produced by claim 30 comprising synthesizing said compound wherein said 
compound binds in a binding pocket of PAK6KD protein. 

54. A method of modulating PAK6KD protein activity, comprising contacting 
said PAK6KD protein with a compound produced by claim 53. 

55. A method of identifying an activator or inhibitor of a protein that comprises 
a PAK6KD active site or binding pocket, comprising 

a) producing a compound according to claim 53; 

b) contacting said compound with a protein that comprises a PAK6KD active 
site or binding pocket; and 

c) determining whether the potential modulator activates or inhibits the 
activity of said protein. 

56. A method of producing an activator or inhibitor identified by claim 55. 

57. A method of producing a computer readable database comprising a 
representation of a compound rationally designed to be capable of binding a binding 
pocket of a PAK6KD protein, said method comprising 

a) introducing into a computer program a computer readable database 
produced by claim 17; 

b) generating a three-dimensional representation of the protein or a binding 
pocket of said PAK6KD protein in said computer program; 

c) designing a three-dimensional model of a compound that forms non- 
covalent bonds with amino acids of a binding pocket of said representation; and 

d) storing a representation of said compound into a computer readable 
database. 
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58. The method of claim 58, wherein said representation is selected from the 
group consisting of the compound's name, a chemical or molecular formula of the 
compound, a chemical structure of the compound, an identifier for the compound, and 
three-dimensional structural coordinates of the compound. 

59. The method of claim 57 further comprising 

e) preparing a binding test compound comprising a three-dimensional 
molecular structure represented by the coordinates contained in said computer readable 
database; 

f) contacting said compound in a binding assay with a protein comprising said 
binding pocket of a PAK6KD protein; 

g) determining whether said test compound binds to said protein in said assay; 

and 

h) introducing a representation of a compound that binds to said protein in 
said assay into a computer-readable database. 

60. The method of claim 59, wherein said representation is selected from the 
group consisting of the compound's name, a chemical or molecular formula of the 
compound, a chemical structure of the compound, an identifier for the compound, and 
three-dimensional structural coordinates of the compound. 

61. A method of producing a computer readable database comprising a 
representation of a binding pocket of a PAK6KD protein in a co-crystal with a compound 
rationally designed to be capable of binding said binding pocket, said method comprising 

a) preparing a binding test compound represented in a computer readable 
database produced by claim 57; 

b) forming a co-crystal of said compound with a protein comprising a binding 
pocket of a PAK6KD protein; 

c) obtaining the structural coordinates of said binding pocket in said co- 
crystal; and 

d) introducing the structural coordinates of said binding pocket or said co- 
crystal into a computer-readable database. 
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62. The method of claim 61, further comprising introducing the structural 
coordinates of said compound in said co-crystal into said database. 

63. The method of claim 57 wherein said binding pocket comprises amino 
acids Val, Lys, Glu, Val, Met, Glu, Leu, Leu, Ser, and Asp. 

64. The method of claim 63 wherein said binding pocket further comprises 
amino acids corresponding to Gly, Glu, Gly, Ser, Thr, Gly, He, Ala, Met, Phe, and Asp. 

65. The method of claim 64 wherein said binding pocket further comprises 
amino acids corresponding to Leu, Gin, Arg, Leu, Gin, Gly, Gly, Ala, Asp, Gly, Phe, Arg, 
and Lys. 

66. The method of claim 57 wherein said binding pocket comprises a binding 
pocket defined by the structural coordinates of at least three amino acids selected from the 
group consisting of Val421, Lys436, Glu452, Val465, Met481, Glu482, Leu484, Leu533, 
Ser543, Asp544, Gly414, Glu415, Gly416, Ser417, Thr418, Gly419, Ile420, Ala434, 
Met438, Phe483, Asp530, Leu423, Gln443, Arg445, Leu449, Gln485, Gly486, Gly487, 
Ala488, Asp491, Gly546, Phe547, Arg675, and Lys676. 

67. The method of claim 66 wherein said binding pocket comprises Val421, 
Lys436, Glu452, Val465, Met481, Glu482, Leu484, Leu533, Ser543, and Asp544 
according to the sequence of Fig. 4. 

68. The method of claim 67, wherein said binding pocket further comprises 
Gly414, Glu415, Gly416, Ser417, Thr418, Gly419, Ile420, Ala434, Met438, Phe483, and 
Asp530 according to the sequence of Fig. 4. 

69. The method of claim 68, wherein said binding pocket further comprises 
Leu423, Gln443, Arg445, Leu449, Gln485, Gly486, Gly487, Ala488, Asp491, Gly546, 
Phe547, Arg675, and Lys676 according to the sequence of Fig. 4. 
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70. The method of claim 57, wherein said binding pocket comprises an active 

site. 

71. A computer readable database produced using the method of claim 57. 

72 A method comprising electronic transmission of all or part of the computer 
readable database produced using the method of claim 57. 

73 A method of producing a computer readable database comprising structural 
information about a molecule or a molecular complex of unknown structure comprising: 

a) generating an x-ray diffraction pattern from a crystallized form of said 

molecule or molecular complex; 

b) using a molecular replacement method to interpret the structure of said 

molecule; wherein said molecular replacement method uses the structural coordinates of a 
crystalline protein of claim 1, or the structural coordinates of Fig. 4, or a subset thereof 
comprising a binding pocket, the structural coordinates of a binding pocket of Fig. 4, or 
structural coordinates having a root mean square deviation for the alpha-carbon atoms of 
said structural coordinates of less than 2.0A; and 

c) storing the coordinates of the resulting structure in a computer readable 
database. 

74. The method of claim 73 wherein said binding pocket comprises a binding 
pocket defined by the structural coordinates of at least three amino acids selected from the 
group consisting of Val421, Lys436, Glu452, Val465, Met481, Glu482, Leu484, Leu533, 
Ser543, Asp544, Gly414, Glu415, Gly416, Ser417, Thr418, Gly419, Ile420, Ala434, 
Met438, Phe483, Asp530, Leu423, Gln443, Arg445, Leu449, Gln485, Gly486, Gly487, 
Ala488, Asp491, Gly546, Phe547, Arg675, and Lys676. 

75. The method of claim 74 wherein said binding pocket comprises Val421, 
Lys436, Glu452, Val465, Met481, Glu482, Leu484, Leu533, Ser543, and Asp544 
according to the sequence of Fig. 4. 
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76. The method of claim 75, wherein said binding pocket further comprises 
Gly414, Glu415, Gly416, Ser417, Thr418, Gly419, Ile420, Ala434, Met438, Phe483, and 
Asp530 according to the sequence of Fig. 4. 

77. The method of claim 76, wherein said binding pocket further comprises 
Leu423, Gln443, Arg445, Leu449, Gln485, Gly486, Gly487, Ala488, Asp491, Gly546, 
Phe547, Arg675, and Lys676 according to the sequence of Fig. 4. 

78. The method of claim 73, wherein said binding pocket comprises an active site. 

79. A computer readable database produced using the method of claim 73. 

80. A method comprising electronic transmission of all or part of the computer 
readable database produced by claim 73. 

81. A method for homology modeling the structure of a PAK6KD protein 
homolog comprising: 

a) aligning the amino acid sequence of a PAK6KD protein homolog with an 
amino acid sequence of PAK6KD protein; 

b) incorporating the sequence of the PAK6KD protein homolog into a model 
of the structure of PAK6KD protein, wherein said model has the same structural 
coordinates as the structural coordinates of a crystalline protein of claim 1, or the 
structural coordinates of Fig. 4, or wherein the structural coordinates of said model's 
alpha-carbon atoms have a root mean square deviation from the structural coordinates of 
Fig. 4, of less than 2.0A to yield a preliminary model of said homolog; 

c) subjecting the preliminary model to energy minimization to yield an energy 
minimized model; and 

d) remodeling regions of the energy minimized model where stereochemistry 
restraints are violated to yield a final model of said homolog. 

82. A method for identifying a compound that binds PAK6KD protein 
comprising: 
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a) providing a computer modeling program with a set of structural coordinates 
or a three dimensional conformation for a molecule that comprises a binding pocket of a 
crystalline protein of claim 1, or a homolog thereof; 

b) providing a said computer modeling program with a set of structural 
coordinates of a chemical entity; 

c) using said computer modeling program to evaluate the potential binding or 
interfering interactions between the chemical entity and said binding pocket; and 

d) determining whether said chemical entity potentially binds to or interferes 
with said protein or homolog. 

83. The method of claim 82 further comprising the steps of: 

e) computationally modifying the structural coordinates or three dimensional 
conformation of said chemical entity to improve the likelihood of binding to said binding 
pocket; and 

b) determining whether said modified chemical entity potentially binds to or 
interferes with said protein or homolog. 

84. The method of claim 82 wherein determining whether the chemical entity 
potentially binds to said molecule comprises performing a fitting operation between the 
chemical entity and a binding pocket of the protein or homolog; and computationally 
analyzing the results of the fitting operation to quantify the association between, or the 
interference with, the chemical entity and the binding pocket. 

85. The method of claim 82 wherein a library of structural coordinates of 
chemical entities is used to identify a compound that binds. 

86. A method for designing a compound that binds PAK6KD protein 
comprising: 

a) providing a computer modeling program with a set of structural 
coordinates, or a three dimensional conformation derived therefrom, for a molecule that 
comprises a binding pocket comprising the structural coordinates of a binding pocket of a 
crystalline protein of claim 1, or a homolog thereof; 
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b) computationally building a chemical entity represented by set of structural 
coordinates; and 

c) determining whether the chemical entity is expected to bind to said 
molecule. 

87. The method of claim 86, wherein determining whether the chemical entity 
potentially binds to said molecule comprises performing a fitting operation between the 
chemical entity and a binding pocket of the molecule; and 

computationally analyzing the results of the fitting operation to quantify the 
association between the chemical entity and the binding pocket. 

88. The method of claim 86 wherein said binding pocket comprises a binding 
pocket defined by the structural coordinates of at least three amino acids selected from the 
group consisting of Val421, Lys436, Glu452, Val465, Met481, Glu482, Leu484, Leu533, 
Ser543, Asp544, Gly414, Glu415, Gly416, Ser417, Thr418, Gly419, Ile420, Ala434, 
Met438, Phe483, Asp530, Leu423, Gln443, Arg445, Leu449, Gln485, Gly486, Gly487, 
Ala488, Asp491, Gly546, Phe547, Arg675, and Lys676. 

89. The method of claim 88 wherein said binding pocket comprises Val421, 
Lys436, Glu452, Val465, Met481, Glu482, Leu484, Leu533, Ser543, and Asp544 
according to the sequence of Fig. 4. 

90. The method of claim 89, wherein said binding pocket further comprises 
Gly414, Glu415, Gly416, Ser417, Thr418, GIy419, Ile420, Ala434, Met438, Phe483, and 
Asp530 according to the sequence of Fig. 4. 

91. The method of claim 90, wherein said binding pocket further comprises 
Leu423, Gln443, Arg445, Leu449, Gln485, Gly486, Gly487, Ala488, Asp491, Gly546, 
Phe547, Arg675, and Lys676 according to the sequence of Fig. 4. 

92. The method of claim 86, wherein said binding pocket comprises an active site. 
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93. A method of producing a mutant PAK6KD protein, having an altered 
property relative to PAK6KD protein, comprising, 

a) constructing a three-dimensional structure of PAK6KD protein having 
structural coordinates selected from the group consisting of the structural coordinates of a 
crystalline protein of claim 1, the structural coordinates of Fig. 4, and the structural 
coordinates of a protein having a root mean square deviation of the alpha carbon atoms of 
said protein of less than 2.0 A when compared to the structural coordinates of Fig. 4; 

b) using modeling methods to identify in the three-dimensional structure at 
least one structural part of the PAK6KD protein molecule wherein an alteration in said 
structural part is predicted to result in said altered property; 

c) providing a nucleic acid molecule coding for a PAK6KD mutant protein 
having a modified sequence that encodes a deletion, insertion, or substitution of one or 
more amino acids at a position corresponding to said structural part; and 

d) expressing said nucleic acid molecule to produce said mutant; 
wherein said mutant has at least one altered property relative to the parent. 

94. A method of producing a mutant PAK6KD protein, having an altered 
property relative to PAK6KD protein, comprising, 

a) constructing a three-dimensional structure of a molecule comprising a 
binding pocket, wherein said binding pocket comprises at least three amino acids selected 
from the group consisting of Val421, Lys436, Glu452, Val465, Met481, Glu482, Leu484, 
Leu533, Ser543, Asp544, Gly414, Glu415, Gly416, Ser417, Thr418, Gly419, Ile420, 
Ala434, Met438, Phe483, Asp530 , Leu423, Gln443, Arg445, Leu449, Gln485, Gly486, 
Gly487, Ala488, Asp491, Gly546, Phe547, Arg675, and Lys676 having the structural 
coordinates of a crystalline protein of claim 1, the structural coordinates of Fig. 4, or the 
structural coordinates of a binding pocket homolog, wherein said the root mean square 
deviation of the backbone atoms of the amino acid residues of said binding pocket and 
said binding pocket homolog is less than 2.0A; 

b) using modeling methods to identify in the three-dimensional structure at 
least one portion of said binding pocket wherein an alteration in said portion is predicted 
to result in said altered property; 
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c) providing a nucleic acid molecule coding for a mutant PAK6KD protein 
having a modified sequence that encodes a deletion, insertion, or substitution of one or 
more amino acids at a position corresponding to said portion; and 

d) expressing said nucleic acid molecule to produce said mutant; 
wherein said mutant has at least one altered property relative to the parent. 

95. A method of producing a computer readable database containing the three- 
dimensional molecular structural coordinates of a compound capable of binding the active 
site or binding pocket of a protein molecule, said method comprising 

a) introducing into a computer program a computer readable database 
produced by claim 17; 

b) generating a three-dimensional representation of the active site or binding 
pocket of said PAK6KD protein in said computer program; 

c) superimposing a three-dimensional model of at least one binding test 
compound on said representation of the active site or binding pocket; 

d) assessing whether said test compound model fits spatially into the active 
site or binding pocket of said PAK6KD protein; 

e) assessing whether a compound that fits will fit a three-dimensional model 
of another protein, the structural coordinates of which are also introduced into said 
computer program and used to generate a three-dimensional representation of the other 
protein; and 

f) storing the three-dimensional molecular structural coordinates of a model 
that does not fit the other protein into a computer readable database. 

96. A method for determining whether a compound binds PAK6KD protein, 
comprising, 

a) providing a computer modeling program with a set of structural coordinates 
or a three dimensional conformation for a molecule that comprises a binding pocket of a 
crystalline protein of claim 1, PAK6KD protein, or a homolog thereof; 

b) providing a said computer modeling program with a set of structural 
coordinates of a chemical entity; 

c) using said computer modeling program to evaluate the potential binding or 
interfering interactions between the chemical entity and said binding pocket; and 
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d) determining whether said chemical entity potentially binds to or interferes 
with said protein or homolog. 

97. A method of producing a computer readable database comprising a 
representation of a compound capable of binding a binding pocket of a PAK6KD protein, 
said method comprising, 

a) introducing into a computer program a computer readable database 
produced by claim 17; 

b) determining a pharmacophore that fits within said binding pocket; 

c) computationally screening a plurality of compounds to determine which 
compound(s) or portion(s) thereof fit said pharmacophore; and 

d) storing a representation of said compound(s) or portion(s) thereof into a 
computer readable database. 

98. The method of claim 97, wherein said representation is selected from the 
group consisting of the compound's name, a chemical or molecular formula of the 
compound, a chemical structure of the compound, an identifier for the compound, and 
three-dimensional molecular structural coordinates of the compound. 

99. The method of claim 97 wherein said binding pocket comprises a binding 
pocket defined by the structural coordinates of at least three amino acids selected from the 
group consisting of Val421, Lys436, Glu452, Val465, Met481, Glu482, Leu484, Leu533, 
Ser543, Asp544, Gly414, Glu415, Gly416, Ser417, Thr418, Gly419, Ile420, Ala434, 
Met438, Phe483, Asp530, Leu423, Gln443, Arg445, Leu449, Gln485, Gly486, Gly487, 
Ala488, Asp491, GIy546, Phe547, Arg675, and Lys676. 

100. The method of claim 99 wherein said binding pocket comprises Val421, 
Lys436, Glu452, Val465, Met481, Glu482, Leu484, Leu533, Ser543, and Asp544 
according to the sequence of Fig. 4. 

101 . The method of claim 100, wherein said binding pocket further comprises 
Gly414, Glu415, Gly416, Ser417, Thr418, Gly419, Ile420, Ala434, Met438, Phe483, and 
Asp530 according to the sequence of Fig. 4. 
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102. The method of claim 101 , wherein said binding pocket further comprises 
Leu423, Gln443, Arg445, Leu449, Gln485, Gly486, Gly487, Ala488, Asp491, Gly546, 
Phe547, Arg675, and Lys676 according to the sequence of Fig. 4. 

103. The method of claim 97, wherein said binding pocket comprises an active 

site. 

104. A computer readable database produced using the method of claim 97. 

105. A method comprising electronic transmission of all or part of the computer 
readable database produced using the method of claim 97. 

106. A method of producing a computer readable database comprising a 
representation of a compound capable of binding a binding pocket of a PAK6KD protein, 
said method comprising 

a) introducing into a computer program a computer readable database 
produced by claim 17; 

b) determining a chemical moiety that interacts with said binding pocket; 

c) computationally screening a plurality of compounds to determine which 
compound(s)comprise said moiety as a substructure of said compound(s); and 

d) storing a representation of said compound(s) that comprise said 
substructure into a computer readable database. 

107. The method of claim 106, wherein said representation is selected from the 
group consisting of the compound's name, a chemical or molecular formula of the 
compound, a chemical structure of the compound, an identifier for the compound, and 
three-dimensional molecular structural coordinates of the compound. 

108. The method of claim 106 wherein said binding pocket comprises a binding 
pocket defined by the structural coordinates of at least three amino acids selected from the 
group consisting of Val421, Lys436, Glu452, Val465, Met481, Glu482, Leu484, Leu533, 
Ser543, Asp544, Gly414, Glu415, Gly416, Ser417, Thr418, Gly419, Ile420, Ala434, 
Met438, Phe483, Asp530, Leu423, GIn443, Arg445, Leu449, Gln485, Gly486, Gly487, 
Ala488, Asp491, Gly546, Phe547, Arg675, and Lys676. 
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109. The method of claim 108 wherein said binding pocket comprises VaI421, 
Lys436, Glu452, Val465, Met481, Glu482, Leu484, Leu533, Ser543, and Asp544 
according to the sequence of Fig. 4. 

1 10. The method of claim 109, wherein said binding pocket further comprises 
Gly414, Glu415, Gly416, Ser417, Thr418, Gly419, Ile420, Ala434, Met438, Phe483, and 
Asp530 according to the sequence of Fig. 4. 

111. The method of claim 110, wherein said binding pocket further comprises 
Leu423, Gln443, Arg445, Leu449, Gln485, Gly486, Gly487, Ala488, Asp491, Gly546, 
Phe547, Arg675, and Lys676 according to the sequence of Fig. 4. 

1 12. The method of claim 106, wherein said binding pocket comprises an active 

site. 

113. A computer readable database produced using the method of claim 106. 

1 14. A method comprising electronic transmission of all or part of the computer 
readable database produced using the method of claim 106. 

115. Crystallizable PAK6 protein. 

116. A method of purifying PAK6 protein linked to a histidine tag comprising: 

a) obtaining a translation vector comprising a coding sequence for PAK6 
protein, linked to a histidine tag; 

b) performing size exclusion chromatography; and 

c) performing nickel chelating column chromatography. 

1 17. Purified PAK6KD polypeptide. 

1 18. The method of claim 1 17 wherein said polypeptide is 98% pure. 

1 19. The method of claim 1 17 wherein said polypeptide is unphosphorylated. 
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120. A method of purifying PAK6 polypeptide, comprising 
expressing PAK6 in insect cells; 

obtaining a soluble protein fraction from said insect cells; 

using a two column chromatograph procedure to obtain purified PAK6. 

121. An insect cell capable of expressing PAK6. 

122. The insect cell of claim 121, wherein said insect cell comprises a vector, 
wherein said vector comprises a nucleic acid sequence coding for PAK6. 

123. The use of the methods of any of claims 6-122 for drug discovery. 
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ABSTRACT 

The present invention provides machine readable media embedded with the three- 
dimensional molecular structure coordinates of PAK6KD and subsets thereof, including 
binding pockets, methods of using the structure to identify and design affecters, including 
inhibitors and activator, mutants of PAK6KD, PAK6KD crystals and compounds and 
compositions that affect PAK6 activity. 
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